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INTRODUCTION 


SomE account of the history of this five-volume course of higher 
mathematics has been given in the Introduction to Vol. I of the 
present English edition. 

The first Russian edition of the present volume appeared (in 1926) 
under the joint authorship of Professor Smirnov and the late Professor 
J. D. Tamarkin but later editions, prepared after Professor Tamarkin 
had settled in the U.S.A. and consisting of a drastic revision of (and 
many additions to) the original material, contained only Professor 
Smirnov’s name. This volume is made up of a course of advanced 
calculus which is of great use to students of mathematics and which 
provides the physicists and engineers with a complete set of those 
tools, based upon the theory of functions of real variables, which 
are indispensable for the study of the classical branches of mathe- 
matical physics. 

It consists essentially of five distinct parts, although there are 
strong links connecting all of them. There is a full discussion of the 
solution of ordinary differential equations with many applications to 
the treatment of physical problems. This is followed by an account 
of the properties of multiple integrals and of line integrals, with a 
valuable section on the theory of measurable sets and of multiple 
integrals. 

The mathematics necessary to the discussion of problems in classical 
field theories is discussed in a section on vector algebra and vector 
analysis; the methods developed are illustrated not only by applica- 
tions to physics but also by an account of the elements of differential 
geometry in three-dimensional space. 

After this there comes an elementary but full account of Fourier 
series. 

The principles and techniques developed in these sections are then 
applied to the discussion of the solution of the partial differential 
equations of classical mathematical physics. 

The clarity of Prof. Smirnov’s exposition and the width of his 
knowledge of the mathematical techniques effective in the study of 
the physical sciences makes the whole course a most valuable one for 
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the student anxious not only to learn the methods of advanced cal- 
culus but also to understand the influences which have motivated 


their development. 
I. N. SNEDDON 


PREFACE TO THE SIXTH EDITION 


Tuts edition of the second volume differs considerably from the pre- 
vious one. The first chapter of the previous edition, containing the 
theory of complex numbers, the principles of higher algebra, and 
integration of functions, was transferred to the first volume. At the 
same time, all material referring to the principles of vector algebra 
was taken from Volume I to Volume I. This material was incorporated 
in Chapter IV, together with vector analysis 

The presentation of the remaining chapters underwent substantial 
changes. This refers particularly to Chapters I, VI and VII. A special 
paragraph containing the theory of dimensions and the rigorous theory 
of multiple integrals was added to Chapter IIL. A certain re-distribution 
of material was carried out in Chapter VI, and a proof was added of 
the closure equation on the basis of Weierstrass’ theorem on polyno- 
mial approximation to continuous functions. Chapter VII now contains 
additional material on the propagation of spherical and cylindrical 
waves and Kirchhoff’s formula for the solution of the wave equation. 
The explanation of linear differential equations with constant coef- 
ficients is introduced at first without using the symbolic method. 

First paragraphs of each chapter have retained their explanatory 
character. The book is arranged in such a way that the basic material 
in larger type can be studied without reference to the examples or 
complementary theoretical material printed in small type. 

I should like to express my deep gratitude to Prof. Fikhtengol’ts, 
who has read the manuscript of this edition, for his valuable sugges- 
tions concerning the style and arrangement of the book. 


V. SMIRNOV 
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PREFACE TO THE FOURTEENTH EDITION 


THE GENERAL arrangement of the present edition is the same as that 
of the previous edition. However, small alterations were introduced 
in many places with the aim of clarifying the style and achieving 
greater readability. 

Most substantial changes were carried out in Paragraph 9 (Chapter 
Ii), “Supplementary remarks on the theory of multiple integrals”. 

In Chapter VU, devoted to simple problems of mathematical physics, 
the formulation of conditions for the solution of a series of basic 
problems was clarified. References to matters explained in detail in 
Volume IV have been added in several places in Chapter VII. 


V. SMIRNOV 
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CHAPTER I 


ORDINARY DIFFERENTIAL EQUATIONS 


§ 1. Equations of the first order 


1, General principles. A differential equation is defined as an 
equation which contains, in addition to independent variables and 
unknown functions, derivatives or differentials of the unknown 
functions [1, 51]. If the functions appearing in a differential equation 
depend on a single independent variable, the equation is called an 
ordinary differential equation. On the other hand, if partial derivatives 
of the functions with respect to certain of the independent variables 
appear in the equation, it is called a partial differential equation. 
We confine ourselves to ordinary differential equations in the present 
chapter, the greater part of which is devoted to the case of a single 
equation containing one unknown function. 

Let x be the independent variable, and y the required function of 
this variable. The general form of the differential equation becomes: 


Dz,y,y’y”,...,y™) =0. 


The order of the differential equation is defined as the order n of 
the highest order derivative of the function that appears in the equation. 
We shall consider ordinary differential equations of the first order 
in the present article. The general form of this equation is: 


D(x, y, y) =9 (1) 
or, on solving with respect to y’: 
y’ = f(x,y). (2) 
If a function 
y = 9(2) (3) 


satisfies the differential equation, i.e. if the equation reduces to an 
identity on replacing y and y’ by g(x) and g'(x), the function p(x) 
is said to be a solution of the differential equation. 
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The problem of finding a solution of a differential equation is 
alternatively referred to as the task of integrating the equation. 

If x and y are considered as the coordinates of points on a plane, 
differential equation (1) [or (2)] expresses a relationship between 
coordinates of points on a certain curve and the slopes of the tangents 
to the curve at these points. A curve corresponds to the solution (3) 
of the equation, the points and tangential slopes of which satisfy 
the equation. This curve is referred to as an integral curve of the given 
differential equation. 

In the simplest case, when the right-hand side of equation (2) does 
not contain y, a differential equation is obtained of the form: 


y’ = f(z). 
Finding the solutions of this equation is the primary task of the 
integral calculus [I, 86], and the total set of solutions is given by the 
formula: 
y = f f(x) dz + O, 

where C is an arbitrary constant. We thus obtain in this elementary 
case a solution of the differential equation containing an arbitrary 
constant. We shall see that a solution containing an arbitrary constant 
is also obtained in the general case of a first order differential equa- 
tion; such a solution is referred to as the general solution of the equa- 
tion. On assigning different numerical values to the arbitrary constant, 
we obtain the various so-called particular solutions of the equation. 

We give in the following sections some particular types of first 
order equation, integration of which Jeads to evaluation of indefinite 
integrals — or, as it may sometimes be expressed, their integration 
reduces to quadrature.t 


2. Equations with separable variables. On replacing y’ in equation (2) 
by the quotient dy/dx, multiplying both sides by da, and carrying all 
terms to the left-hand side, we can write (2) in the form: 


M(x, y) da + N (x, y)dy = 0, (4) 


which will be more convenient in some cases. Both variables x and y 
play an identical role here in the equation, so that (4) does not bind 
us to the choice of unknown function: we can take either 2 or y for 
this, as we wish. 


+ Evaluation of an integral has a direct connection with evaluation of an 
area, whence the term ‘‘quadrature’’. 
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We assume that each of the functions M(x, y) and N(x, y) consists 
of the product of two factors, one of which depends only on g, and 
the other only on y: 


M (x) M,(y) dz + N,(x) NA(y) dy = 0. (5) 


On dividing both sides of the equation by M(y) Ny(x), we reduce it 
to the form: 


M(x N 
Me Oe + egy ay =O (6) 
so that the coefficient of dz now depends only on a, and the coefficient 
of dy only on y. Equation (5) is called an equation with separable 
variables [1, 93], whilst the method itself of reduction to the form (6) 
is called separation of the variables. 
The left-hand side of equation (6) is the differential of the follow- 
ing expression: 


M(x) N(Y) 
may 8 + Janay & 
and the equating to zero of the differential of this expression means 
that the expression itself is equal to an arbitrary constant: 

ME) ay 4 | NW ay = — 

may 2+ | gy Y= 0) 

where C is the arbitrary constant. This formula gives an infinite se 
of solutions and, from the geometrical point of view, is the implici 
equation of a family of integral curves. If 
the quadratures are carried out in (7) and 
we solve the equation with respect to y, 
we obtain the explicit equation of the 
family of integral curves (the solution of 
the differential equation): 


y = p(z, C). 


Example. The area OAMN, bounded by the 
coordinate axes, the segment AM of a curve 
and its ordinate MN (Fig. 1), is equal in mag- 
nitude to a rectangular area OBON with the same base ON = x and with 
height 7: ; 

x x 
| y de = an; n=- [yaz (8) 
0 ò 
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The magnitude 7 is called the average ordinate of the curve in the interval (0, x). 
Let us find the curves whose average ordinates are proportional to the 
extreme ordinate NM. We have on the basis of (8): 


x 
f ydr = kay, (9) 
0 


where k is the coefficient of proportionality. On differentiating both sides of 
equation (9), we get the differential equation: 


y=ky+kay’, or ry’ = ay, (10) 
where 
l—k 


z (11) 


a= 


Unwanted solutions may have been introduced on differentiation, since the 
equality of the derivatives implies only that the functions themselves differ 


0=20-1 C= 


Fic. 2 Fic. 3 


by a constant. There are no unwanted solutions in the present case, however. 
It follows from equation (10), obtained by differentiation of (9), that both 
sides of (9) can only differ by a constant; but it is immediately evident that 
both these sides are zero for æ = 0, so that the constant in question is zero, 
ie. every solution of (10) is also a solution of (9). We pass to the integration 
of (10). It can be written as: 


and the variables can be separated: 
PE A 
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We obtain on integration: 
logy =alogz +0, or y=Cr’, (12) 
where C = e“ is an arbitrary constant. 

From (11), as k increases from 0 to +0, a decreases from + œ to —l, 
and we must therefore take a > — 1, so that the integral on the left-hand side 
of (9) never becomes meaningless. We have a = 0 for k = 1, and (12) gives 
as the obvious solution a family of straight lines parallel to axis OX. We have 
a = 2 for k = 1/3, which gives the family of parabolas (Fig. 2) 

y = Cr, 
for which the average ordinate is equal to a third of the extreme ordinate. 
With k = 2, we get the family of curves: 
_ 0 
Va’ 


for which the average ordinate is twice the extreme ordinate (Fig. 3). 


Y 


3. Homogeneous equations. A homogeneous equation is defined as an 
equation of the form: 
= f(4)f. (13) 


We preserve the previous independent variable x but introduce a 
new function u instead of y: 


y = xu, whence y’ = u +- xu’. (14) 


We transform our equation: 


u + zu’ = f(u) or z SY = ffu) — u. 


Separation of the variables gives: 


dx go ay: 


a u— f(u) 
We obtain, on denoting the coefficient of du by y(u): 


log x + f y, (u) du = Cy, 
whence 
x = Ce~Svidu or x = Cy(u), 


where C = e“! is an arbitrary constant. 


t We remark that the function p(x, y) of two variables is a function simply 
of the ratio y/z when, and only when, the magnitude of g(x, y) is unchanged 
on multiplying x and y by an arbitrary factor t, i.e. p(tx, ty) = p(x, y). This 
condition is equivalent to p(x, y) being a homogeneous function of x and y 
of zero degree [1, 151}. 
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On returning to the previous variable, we can write the equation 
of the family of integral curves in the form: 


z= Op (4). (15) 


We consider a transformation of similitude in the plane XOY 
with centre of similitude at the origin. The transformation amounts 
to the point (x, y) being transferred to the new position: 


xı = kx; yy=ky (k>0) (16) 


or, which comes to the same thing, it amounts to multiplying the 
length of the radius vector to every point of the plane by k whilst 
preserving the direction. If M is the original position of a point, and 
M, the position of the same point after transformation, we have 
(Fig. 4): 

OM, : OM =2,:4=y,:y =k. 


On applying transformation (16) to equation (15), we get the 
equation: 


xı = kCy (4). (17) 


which does not differ from equation (15), in view of the arbitrariness 
of the constant C, i.e. transformation (16) does not alter the total 
set of curves (15) but only moves one curve of family (15) to the posi- 
tion of another curve of the same family. Any curve of family (15) 
can evidently be obtained from one definite curve of the family by 
using transformation (16), with appropriate choice of the constant 
k. The result obtained can be expressed as follows: all the integral 
curves of a homogeneous equation can be obtained from one integral curve 
by means of the transformation of similitude, with centre of similitude at 
the origin. 
Equation (13) can be written as: 


tan a = f(tan 0), 


where tan a is the slope of the tangent, and @ is the angle formed 
by the radius vector from the origin with the positive direction of 
axis OX. Equation (13) thus establishes a connection between angles 
a and 0, such that the tangents to the integral curves of a homogeneous 
equation, drawn at the points of intersection of the curves with a straight 
line through the origin, must be mutually parallel (Fig. 4). 

It follows obviously from this property of the tangents that the 
transformation of similitude with centre of similitude at the origin 


3] HOMOGENEOUS EQUATIONS 7 


transforms one integral curve to another integral curve, since, on 
increasing the lengths of radius vectors of points of the curve in the 
same ratio, the directions of the tangents at the ends of the radius 
vectors are unchanged (Fig. 5). 


0 
Fie. 4 Fic. 5 


If we apply the above transformation to the integral curve consist- 
ing of a straight line passing through the origin, we get the same 
line after transformation, so that in this case the above method of 
obtaining the set of integral curves from one of them fails. 


Example. To find the curves such that the length MT of the tangent from 
its point of contact to its intersection with OX is equal to the length OT 
along OX (Fig. 6). 

The equation of the tangent has the form: 


Y —y =y (X — 2), 


where (X, Y) are the current coordinates of the tangent. We find the intercept 
of the tangent on OX by putting Y = 0: 


OT = g — 2 ’ 
Y 
and by hypothesis, MT? = ỌT?, which gives us [I, 77]: 


r +9 = (x ~ ty, 


whence we obtain the differential equation: 


, 2ey 
a? — y? ’ 


which evidently belongs to the homogeneous type. 
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We introduce a new function u instead of y, in accordance with the formula: 
yY = tu; Y = rw +u. 


We have, on substituting in equation (18): 


z x 2u du upu 
E e s u ae) 
which gives, on separating the variables: 
daz l— u? 


Integration gives us: 


æ(u? + 1) =C 


u 
or, on returning to the previous variable y: 
a? +y? — Cy = 0, (21) 


i.e. the required curves are circles passing 
through the origin and touching the axis OX at 
this point (Fig. 6). 

We divided both sides of equation (19) by 
(u + u?) in order to pass from (19) to (20), and 
we might have lost the solution u = 0, or, what 
comes to the same thing, y= 0. We see on 
substituting in equation (18) that this is in fact 

Fra. 6 a solution. The solution is contained in equation 
(21), however; we can obtain it by dividing both 
sides of (21) by C, then setting C = o. 

Each circle of family (21) can be got from any one of them by the trans- 

formation of similitude with centre of similitude at the origin, so that (Fig. 6): 


OM, OM, OM, _ 


ON, ON, ON, 


The differential equation: 


sy fez tbyte | 22 
da Sierra)’ (22) 


reduces to a homogeneous form, as we shall now show. We introduce new 
variables é and 7 in place of x and y: 


w=F+ta y=n+ß, (23) 


where a and # are constants which we proceed to define. 
Equation (22) becomes in the new variables: 


=! af + bn +aa+ dB +c ) 
a$ +bnt+taatbbta S 


4] LINEAR EQUATIONS; BERNOULLI'S EQUATION 9 
We define a and # by the conditions: 


aa+bB+ec=0, aatbd pte =O. 


The equation now reduces to the homogeneous form: 


n 
an _, a+ bie 
JE 7 

$ a tby 


Transformation (23) corresponds to parallel displacements of the axes, 
with the origin becoming the point of intersection (a, 8) of the straight 
lines 


axtby+e=0 and agz+ by +c =0. (24) 


The results previously obtained will thus be applicable to equation (22), 
with the only difference that the role of origin will be played by the 
point (a, 8). 

If the straight lines (24) are parallel, the transformation mentioned can- 
not be carried out. But in this case, as we know from analytic geometry, 
the coefficients in equations (24) must be proportional: 


Ho, and a,x + by = å (ax + by); 


on introducing a new variable u instead of y: 
u = ax + by, 
it can easily be seen that we obtain an equation with separable variables. 


We shall encounter below an extremely important application of 
homogeneous equations, to the investigation of fluid flow. 


4. Linear equations; Bernoulli’s equation, An equation of the form: 


y + P(z)y+QR(z)=0. (25) 


is called a linear equation of the first order. 
We start by considering the equation with no term Q(z): 


z +P (z)z=0. 
The variables are separable here: 
& + P(x)dzr=0, 


and we get: 
z = CeT SPO ax | (26) 
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We integrate the given linear equation (25) by using the method 
of varying the arbitrary constant, i.e. we seek a solution of the 
equation in a form analogous to the form (26) for z: 


y = wer SPO) dx, (27) 
where u is no longer a constant, but the required function of x. We get 
g q g 
by differentiation: 


y =u e SPMdx pP (a) ue- $ P(x) dx 
Substitution in equation (25) gives: 
u’ e~ SP@) 4x 1 Q(x) = 0 
a = —Q(x)eSPO%, whence u = C — f Q(x) SPO da, 
We finally get, by equation (27) for y: 
y =e SPR dx [O — (Q(x) SPO da]. (28) 


When determining y by this formula, we only need to take one each 
of the values of the indefinite integrals 


f P(z)dz and f Q(x) SPO% dx, 


since, the addition of arbitrary constants to these only changes the 
value of C. 

If we replace them by definite integrals with variable upper limits 
[I, 96], we can re-write (28) as: 


— §P(x)dx 


x (P(x) dx 
yo=e” [e — (Q(x) oà dz], (29) 


where 2, is a definite number, though chosen arbitrarily. On sub- 
stituting the value x = x, for the variable upper limit, the right- 
hand side of the formula written is equal to C, since integrals with 
identical upper and lower limits are equal to zero; in other words, 
the constant C in formula (29) is the value of the function y at x = ay. 
This value, which we denote by Yo, is called the initial value of the 
solution. 
We denote this fact by writing: 


y [sat =, Yo. (30) 
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If the initial value of the required solution is given for x = £o, (29) 
yields a completely defined solution of the equation: 


(31) 


— P(x) dx z P(x) dx 
Y =e a | aa}. 


Yo— [Q(x e* 
To 
Condition (30) is called the initial condition and is equivalent 
geometrically to the integral curve being sought which passes through 
the given point (£o Yo). 
If we take Q(x) = 0, we obtain the solution of the homogeneous 
equation 


y + P(x)y=0. 
satisfying condition (30): 
— FP(x) dx 
y=ye ™ (31) 


It follows from (29) that solutions of a linear differential equation 
have the form: 
y = pı (2) C + p (x), (32) 


i.e. y is a linear function of the arbitrary constant. 
Let y, be a solution of equation (25). On setting 


Y=y +z, 


we get the equation for z: 
a’ + P(x) z+ [yi + P (2)y + Q(e)] =0. 


The sum appearing in square brackets is equal to zero, since y, is a 
solution of equation (25) by hypothesis. It follows that z is a solution 
of the equation when the term Q (x) is absent and is defined by 
(26), whence: 

y = y, + CeT SP dx, (33) 


We now assume that a further solution y, is known of equation (25), 
and we let this solution be obtained from (33) with C = a: 
Ya = Yı + ae7 SPO) dx (34) 


If we eliminate e-$P® * from (33) and (34), we obtain an expression 
for the solution of a linear equation in terms of two of its solutions 
yı and y: 

Y = Yı + Ci (Y2 — Y1)» (35) 
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where C, is an arbitrary constant replacing C/a in the previous notation. 
The following relationship follows from (35): 


Y—-y .1-G o 

yY¥-" 0 a? (28) 
which shows that the ratio (y,—y)/(y—y,) is a constant, i.e. the family of 
integral curves of a linear equation is a family of curves that divide 
the segment of ordinate between any two curves of the family in a constant 
ratio. 

If two integral curves L, and L, of a linear equation are known, 
any other integral curve L is defined by the constant value of the 
ratio (Fig. 7) 

AA, _ BB, _ 0O, _ DD, 


AA BB OC DD 


It follows from this equation that 
chords A,B, AB and A,B, must 
either meet in a single point or be 
parallel. On letting B,B, approach 
indefinitely near to A,4,, the direc- 
tions of these chords become the 
directions of the tangents to the 
curves at A,,A and A,, and we 
obtain the following property of the tangents to the integral curves 
of a linear equation: the tangents to the integral curves of a linear 
equation at the points of intersection of the curves with a line drawn 
parallel to OY either intersect in a single point or are parallel, 


Fie. 7 


Examples. 1. We consider the transient current in a circuit with self- 
inductance. Let 7 be the current, v the voltage, R the resistance in circuit, 
and L the self-inductance. 

The following relationship is valid: 


whence we obtain the linear equation for 7: 
di R. v 
adh Ee e 
We take R and L as constants and v as a given function of time ¢, and evalu- 
ate the integrals appearing in formula (31): 


t t t t 
R R frat 1 
feas [a= fo d= — > fo dt. 
0 0 9 0 
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If we let 7, denote the initial value of 4, i.e. the value of the current at t = 0, 
(31) gives us the following formula for determining i at any required instant: 


— -t 
i= (i — -Ẹ)° L +>. 


The factor e-®#/L rapidly decreases as ¢ increases, and in practice the process 
an be assumed to have reached the steady state after a short space of time, the 
urrent being then given by Ohm’s law: i = v/R. 

In the particular case of 7, = 0 we get the formula: 


—7t 
< v L 
i=- ( —o ) (37) 
for the current in a closed circuit. 
The constant L/R is called the time constant of the circuit. 
We consider a voltage v of sinusoidal form, v = A sin wt. We obtain by using 
(31): 
A i 
TEN [a+ fer sin wt dt]. 
ò 


It is easily seen [I, 201] that: 


R 
=f =t 
L L RL y wL? 

fe sin wt dt =e [ wo? L? 4 R sin ot — -rpa pr coset] 


and therefore: 


2 
sin wt — a), cos at |+ 


w wL 
w? J2 + R? 


R 
t EJ RL 
wL? + Rt ` 


We obtain on substituting in the expression for 7: 


R 
i=(i p— 274 eo: RA ag wLA 
o t et R og R O eee 0S 


(38) 
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The first term, containing the factor e~®/L, is rapidly damped, and in practice 
the current will be given in a short space of time after ¢ = 0 by the sum of 
the two remaining terms of (38). This sum consists of a sinusoidal quantity of 
the same frequency w as the voltage v, but with different amplitude and phase. 
We also notice that the sum giving the steady-state current does not depend 
on the initial value of the current tą. 

2. The resistance R cannot be reckoned as a constant in switching processes, 
when a spark appears. It increases from an initial value R, to infinity (at 
the instant t of breaking contact). 

It is sometimes permissible to express the relationship between R and t 
by the formula: 


R=— R, w Ryt 
1— t t—t 
T 
This leads us to the equation: 
di R,T voo 
dt tG’ g Y 


To express ¢ in parts of r, we need to introduce a new variable x instead 
of t, according to the formula: 


t = TY, 


where x varies from x = 0 (initial instant) to x = 1 (the instant of quenching 
the spark, of breaking contact). The equation takes the form: 


EA + To i- F =0 (39) 
with the condition: 
; : : v 
isi ee). 
On applying (28), we easily obtain the general solution of the equation 
Rot Rot 
i= (1 =a) t | faa) tasto], 
where two cases can be distinguished: 
1) R ar 2) <r =t. 
We find in case 1): 
Ret 
i= Bre" =e) +00 —a)* 


and we determine the arbitrary constant C on substituting v = 0: 


; or apenn e 
w= prg t CS- Rae 
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and finally, 
Ror 
; vT ; vT Ep. 
i= la Ga- r) -a 3 (40,) 
We proceed similarly and find in case 2): 
i= (1—2) [i 10g (1 —a)]. (40,) 


Bernoulli’s equation is a generalisation of the linear differential 
equation (25): 

y + P (æ)y +Q(z)y"=0, (41) 
where the exponent m can be considered as differing from zero and 
unity, since the equation is linear in these cases. We divide both 
sides by y”™: 

yy’ + P (a)y + Q (x)= 0 
and we introduce a new unknown function u instead of y: 
u=y 7"; w = (1 —my™y’. 
The equation now reduces to the form: 
w = P,(a)u+Q,(z)=0, 


P, (x) = (1 — m) P(x) and Q, (x) = (1 — m) Q (a), 


i.e. Bernoulli's equation reduces to a linear equation by substituting 
u = y!-™ and is then integrated as a linear equation. 

We remark that integration of the differential equation of the 
form: 


where 


y+ P(z)y+Q(z)y? + R(x) =0, (41,) 
known as Riccati’s equation, does not reduce to quadrature in the 
case of arbitrarily chosen coefficients. It can reduce to a linear equation 
if any one particular solution is known. Let y(x) be in fact a solution 
of equation (41,), i.e.: 

Yi + P (zy. + Q(x) yi + R(x) =0. (*) 


We introduce into (41,) a new required function u instead of y, 
where 
1 
Y="HTT: 


u 


On substituting in (41,) and taking into account equation (*), we 
obtain a linear equation for u of the form: 


u’ — [P (x) + 29 (x) y,Ju — Q(x) = 0. 
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The general solution of this equation has the form: u =0 g(x) + y(x). 
If we substitute this expression for u in the equation for y written 
above, we get the general solution of Riccati’s equation in the form: 


Co, (x) + y(x) 
Opals) + y,(2) ` 


5. Finding the solution of a differential equation with a given 
initial condition. As we have said, a first order differential equation 


y = f(x, y) (42) 


consists of a relationship between the coordinates (x,y) of a point 
and the slope y’ of the tangent at this point. We assume that f(x, y) 
is a single-valued, continuous function of (x, y). A definite tangent 
with a slope equal to f(x,y) now 
corresponds, by (42), to any point 
of the plane at which f(x, y) is de- 
fined. On indicating the direction 
of this tangent by an arrow passing 
through the corresponding point, we 
arrive at a tangent field in the plane, 
every tangent being associated with 
some point of the plane. The integral 
curves of equation (42) are the curves 
the tangents of which are the tan- 
gents of the field and they may be 
designated the integral curves of the 
given field. 

The magnetic field at the earth’s 
surface may be taken as an example. If we regard a portion of the 
earth’s surface as a plane, the direction shown by the magnetic needle 
at each point gives us a definite tangent at every point. 

We now turn to the question of finding the integral curves of 
equation (42). The complete definition of the position of an integral 
curve requires the further assigning of some point through which 
the integral curve must pass, e.g. its intersection with the line x = vo 
parallel to OY; or, what amounts to the same thing, we must assign 
the initial value y, that the required function y must take for the 
specified value 7 = 2: 


Fic. 8 


y ane = Yo. 
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The integral curve passing through the given point (£o, Yo) can be 
drawn approximately by using Euler’s method, explained below. 

We mark out a mesh of small equal squares in the coordinate 
plane by lines parallel to the axes, then we draw from the origin, 
in the negative direction of OX, the intercept OP, of unit length 
(Fig. 8). We substitute x = x, and y = y in the right-hand side of 
equation (42) and having found the value of f(x», Yo), we mark off 
the intercept OA, equal to this value on the ordinate axis. The line 
PA, will evidently have a slope equal to f(x, Yo) and will therefore be 
parallel to the tangent to the integral curve at the point (£o, yp). 
We now proceed to the approximate construction of the integral 
curve itself in the form of a step line. 

We produce from the point (£o, Yo) a line M, M,, parallel to PA, 
and hence having a slope Yó = f(£os Yo). Let M (tı y,) be the first 
point of intersection of this line with any side of our square mesh. 
We cut off a segment OA, on the ordinate axis equal to f(x, y1), and 
produce through the point M (x, Yı) a line M, M,, parallel to PA, 
[and therefore having a slope yj = (2, y,)], to its first intersection 
at M,(22, y,) with a side of our square mesh, and so on. This con- 
struction can be carried out both in the direction of increasing, 
and in the direction of decreasing, abscissae. The step line obtain- 
ed in this way represents approximately the required integral 
curve. 

We further remark that a different scale can be used for drawing 
the intercepts OP and OA,, OA;, ... than is employed for the coordi- 
nates x and y, since the directions of PA, PA;,... are evidently 
independent of the choice of scale for the intercepts. 

This construction makes it clear by inspection that one and only 
one integral curve of equation (42) passes through a given point 
(£o Yo): 

This assertion is susceptible of rigorous proof if the function f(z, y) 
has properties in addition to continuity. For instance, if f(x, y) is a 
single-valued, continuous function of its arguments in the neighbourhood 
of the point (£o, Yo) and has a continuous derivative with respect to y, 
one and only one integral curve of equation (42) passes through the 
point (Xo, Yo). 

This theorem, which at present we accept without proof, is usually 
called the existence and wniqueness theorem for the solution of a 
differential equation with a given initial condition. The theorem is 
proved at the end of the next chapter. 
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We supplement the above geometrical explanation with an analytic 
explanation of the theorem in an important particular case, viz, 
in the case where the right-hand side of equation (42) consists of a 
series expansion into positive integral powers of the differences 
(@ — a») and (y — yo) [I, 161]: 


f(e, y) = X ay (€ — 2) (Y — Y), 


which is convergent if the absolute values of the differences are 
sufficiently small. 
Here, the solution of equation (42) satisfying the initial condition 


Yy pen = Yo, (43) 


can be written as a Taylor series in positive integral powers of the 
difference (x — z), the coefficients of the series being completely 
defined by equation (42). In fact, on substituting % = x and y = Yo 
in the right-hand side of (42), we obtain the value y, of the first 
derivative y’ at £x =g, We get on differentiating (42) with re- 
spect to x: 

y" = chs y) + tety; 
if we substitute «= £o Y = Yo y’ = Yo in the right-hand side of 
this equation, we find the value yọ of the second derivative y” at 
£ == £o Further differentiation of the equation written above with 
respect to x gives us an equation in y’’’ and so on. We thus determine 
the Taylor series: 


y= 4 +2 (7 — a) + Éen., (=) 


which in fact gives, for values of x near Xp, the solution of (42) satisfy- 
ing the initial condition (43). 

The method of undetermined coefficients may be used as an 
alternative to the above method of determining successively the 
derivatives at x = x). We replace y on both sides of (42) by a power 
series with undetermined coefficients: 


Y = Yo + 4, (£ — To) + Ay (4 — To)? +... (45) 


By expanding the right-hand side in powers of (x — £o) and 
equating coefficients of like powers of (x — £o), the coefficients 
ai dp ... can be successively determined. It can easily be shown 
that series (44) and (45) are identical. 
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Example. We find the solution of the equation: 


yap, (46) 
satisfying the initial condition: 
A Y lxo = 1, (47) 
as a power series: 
y=1lt+ 5 ast, 


where the constant term has been taken epual to unity in view of the initial 
condition (47). 
We differentiate the series: 


We substituto these expressions for y and y’ in equation (46): 
a, + 20,2 + 3a H.. (n+ ]) ana +... = 
1 


w(l-a,e4+a,¢%*-+...4+ an, 2" 14...). 


to] 


We equate coefficients of like powers of x on both sides and obtain 
the relationships shown in the table. It is 


clear from these that 
x a, =0 
Qi = Oy = dg =... = Agqyy = +. =0;3 l 
gl 2a = > 
E NIAE: EEE EE oe 
e a T T g? 3a, = 5 a 
‘ A l 
i.e. finally [1, 126] x3 4a, = za 
x ] 2 \2 l 2NS — be w «© © © @ 2 © © © ew 8 
y=1+F4+5-( EE: 
4° 24 3r (a A 1 
x (n + 1) an = -F Gna 
l x? n 7 re Se sol 
++ a) +...=0. 


6. The Euler—Cauchy method. The approximate construction for an 
integral curve of equation (42) given in the previous article can be 
simplified by using lines only, parallel to OY, instead of the mesh 
of squares. This modified Eulerian method results in a relatively 
simple and handy means of evaluating approximately the ordinate y 
of an integral curve for a previously assigned abscissa x. 
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Let M,(%o, Yo) be the initial point of the integral curve (Fig. 9). 
We produce a line with a slope f(x, Yo) from this point to its inter- 
section with the line x = x, parallel to OY, in the point M,. Let 
y, be the ordinate of M,. It is evidently given by the relationship: 


Yı — Yo = Í (Lo Yo) (41 — T0)» 


since M,N and NM, are given by the numbers (%,—2,) and (y,—y,), 
whilst the tangent of angle NM,M, is equal to f(x, Yọ) by con- 
struction. 

We draw M, M, with slope f(x, y,) from the point (z, yı) to its 
intersection at M, with the next line x = x, parallel to OY. The 


ordinate of the point of intersection will be given by a relationship 
similar to the above: 


Ya — Yı =F (21 Y1) (22 — 2). 


Proceeding in the same way from the point M,(2, Y2), we can 
next obtain the point M,(2., y,) and so on. The lines PAo, PA, .. 
have the same role in Fig. 9 as in Fig. 8. 

We now suppose that, for a given value of x, we have to find the 
value y of the solution of equation (42) that satisfies the initial 
condition (43). By what was said above, we must proceed as follows: 
we subdivide the interval (x9, x) by points: 


Lo < Ly < Ta <L g L... <M L En LT... (48) 
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and then obtain the ordinates y,, Yz -- ., Yn—, in accordance with the 
formulae: 

Yı — Yo = Í (Lo Yo) (21 — To) 

Ya — Yı = F (21, Y1) (£2 — %) 


~Y: = f (22, Ya) (25 —") Ta) (49) 


Yn—1 — Yn- = Í (Tn—2 Yn—2) (n1 So n-a) 
Y— Yn- = Í (Enis Yn—1) (x — En). 


With the conditions laid down in [5] for the properties of the 
function f(x, y), if the number of sub-intervals increases, with each 
tending to zero, the quantity Y obtained from (49) will tend to the 
true ordinate y of the required integral curve provided the given x 
is sufficiently close to the initial x, 

We easily find, on adding equations (49) term by term: 


Y ~ Y =Y + Í (2o Yo) (%1 — Vo) + f (2 Y1) (Ta — 2) +... + 
+ Í (Zn-2 Yn—2) (£n: a Ln—2) oF f (En Yn-1) (x aj Ln—1) ž (50) 


In the elementary case of the equation: 


y’ = f (x) 


the formula written takes the form: 
n—-1l 
Yo + >f (£s) (s41 = s) : 
8=0 


which, as we know [I, 87], gives an approximate expression for the 
value of the integral y, + J f(x) dz, i.e. for the solution of the given 
equation. 

The computation in accordance with (49) is carried out in the 
following order. The first of equations (49) gives thedifference (yı — Yo). 
We add this to yo, obtain the second ordinate y, then find the difference 
(Y2 — y,) with the aid of the second of equations (49). We add this 
latter to Yy, obtain the third ordinate y, then find (y — y,) with the 
aid of the third of equations (49), and so on. We find Y by adding all 
these differences to yo. 


Example. We apply the method given to the solution of equation (46) with 
initial condition (47). We shall take all the intervals (£o 2), (£1, %2) -.. equal 
to 0.1. 
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z y z 4y = 21.01 et 

0 1 1 0 l 

0.1 1 l 0.05 0.005 1.0025 
0.2 1.005 0.1005 0.0101 1.0100 
0.3 1.0151 0.1523 0.0152 1.0227 
0.4 1.0303 0.2061 0.0206 1.0408 
0.5 1.0509 0.2627 0.0263 1.0645 
0.6 1.0772 0.3232 0.0323 1.0942 
0.7 1.1095 0.3883 0.0388 1.1303 
0.8 1.1483 0.4593 0.0459 1.1735 
0.9 1.1942 0.5374 0.0537 1.2244 


The results of the computation are shown in the accompanying table. The 
first column contains v, the second contains the corresponding y, the third 
the value of f(x, y), ie. xy/2, the fourth the difference Ay = y,,, — Ys, and 
the last the value of the ordinate of the accurate integral curve y = e"f, 

As can be seen from the table, the error with z = 0.9 is less than 0.031, 
i.e. amounts to roughly 2.5%. 


7. The general solution. On altering the value of y in the initial 
condition: 
Ylx=xo = Yo 
we obtain an infinite set of solutions of equation (42), or in geometrical 
terms, a family of integral curves depending on the arbitrary constant 
Yo, this being the ordinate of the point of intersection of an integral 
curve with the line + = x). Instead of appearing in the solution as 
the initial value of y, the arbitrary constant can also appear in the 
general form: 
y = g(x, C). (51) 
Such a solution of (42), including the arbitrary constant, is called 
the general solution of the equation, as already mentioned [1]. It can 
also appear in implicit form: 


y (z, Y, C) =0. (52) 
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If we assign a definite numerical value to the constant C, we obtain 
a definite solution of (42), which is referred to as a particular solution 
of the equation. To distinguish the curve passing through a given 
point (£a Yo) from the family of curves of the general solution (52), 
we have to find the numerical value of C from the condition: 


(Zo. Yor C) = 0. (53) 

The following is the converse of the problem of integrating a first 

order differential equation: given the family of curves (52), depending 

on a single parameter C, it is required to form the differential equation 
for which this family is the family of the general integral. 

We get on differentiating the given equation (52) with respect to z: 


dy(z, y, C) plz yC) p 

= ae + ETE y =0. (54) 

Elimination of parameter C from equations (52) and (54) gives us 
the required differential equation of family (52): 


D(z, Y, y’) =0. 


After solution with respect to the arbitrary constant, the general 
solution (52) can be written in the form: 


w(x, y) = C. (55) 


We obtain the general solution in this form in the case of the 
equation with separable variables [2]. The function w(x, y) on the 
left-hand side of (55), is called a solution of the differential equa- 
tion (42). 

We must obtain a constant on substituting any particular solution 
of (42) for y in this function, i.e. the solution of (42) is a function 
of x and y such that its total derivative with respect to x is zero, by virtue 
of (42). 

On taking the total derivative with respect to x of both sides of 
equation (55), we get [I, 69]: 

Sole, y) 4+ eea y’ =0, 
or, on replacing y’ by f(x, y), inasmuch as y is a solution of (42) by 
hypothesis, we have: 


O(a, y) 
Ox 


+CV f(x,y) = 0. (56) 
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The function w(x, y) must satisfy this equation independently of the 
precise solution of (42) that we have substituted in this function. But 
in view of the arbitrariness of the initial condition (43) in the exist- 
ence and uniqueness theorem, we can take any values we please of 
x and y, provided we take all the solutions of equation (42), i.e. 
the function w(x, y) must satisfy equation (56) as an identity in x and y. 
We finally show how a solution of equation (42) can be checked 
when it is given implicity: 


w, (x, y) = 0. (57,) 


We obtain as above the equation: 


3 A Oe, (x, 
OE) 4 SY iwy) = 0, (57,) 
which must be satisfied at all points of curve (57,), i.e. equation (57,) 
is to be satisfied only by virtue of (57,) and not as an identity in x 
and y: in short, (57,) must be a consequence of (57,). 


Example. 
We take, for instance, the equation: 
, 1-32 y 
a Qxy i 


It is easily shown that the circle: 
a? + y?—1=0 
is a solution of this equation. Here, in fact, f(x, y) = (1 — 3a% — y’)/2ay and 
w(x, y) = x? + y? — 1, so that (57,) reads: 
I~ 3at?—y? _ F 
2e + 2y — oy — =0, ie z 


which is evidently satisfied by virtue of the equation of the circle. We show 
that the general integral of the given differential equation is: 


l-r- 


0, 


w+ vy? — x = C. 
We get by substituting in (56) w,(x, y) = 23 + zy? — v: 
1 — 3a? — y? 
2ay 


and it is obvious that this equation is satisfied identically for all x and y. 


3x? + y? — 1 + Qry = 0, 


Let the differential equation be given implicitly with respect to y’: 
Biz, y, y’) = 0. (58) 
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If we solve it for y’, we reduce it to form (42), though f(x, y) can 
now be a many-valued function. We suppose that the function has m 
different values, so that there are m different values of y’ for a given 
x and y, i.e. instead of a single tangent corresponding to a given 
point, we have m different tangents. As a result, we now have m 
different tangent fields in the plane instead of one tangent field. 
An integral curve passes through a given point for each of these 
fields, so that altogether m integral curves of equation (58) will pass 
through the given point. Yet the general integral of (58) will contain 
only one arbitrary constant, i.e. will have the form (52); on the 
other hand, equation (53) must in general give m distinct values, 
and not one value, for C. 


We make up an example in connection with these last remarks, where 
the solution containing an arbitrary constant is not strictly speaking the general 
solution. We take the differential equation: 


y? — ay’ = 0. (59) 
The left-hand side can be factorized, giving y’(y’ — x) = 0, so that in essence 


we have two distinct differential equations: 


y =Oandy — x = 0, 
with general solutions 
and 


y-a- C0. (593) 
The last two equations can be combined: 
(y—6)(y—-2t—c) =0, 


giving the general solution of equation (59). Two integral curves pass through 
every point of the plane: the straight line (59,) and the parabola (59,). Evidently 
(59,), y =C, gives a solution of (59) containing an arbitrary constant; this 
solution is not the general solution of (59), but only the general solution of the 
equation y’ = 0. 


Equation (42), or (58), can have a solution which is not contained 
in the family of the general solution, i.e. cannot be obtained from 
(52) with some particular value of constant C. Such a solution is 
called a singular solution of the equation. We go into the problem 
of finding such solutions, and their geometrical interpretation, in [10]. 

Strictly speaking, the concepts of solution and general solution are 
in need of further explanation. We shall not go into the matter, 
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however, inasmuch as the existence and uniqueness theorem for 
the solution with a given initial condition is a more natural basis 
for a theoretical treatment of differential equations. Finding the 
general solution, as described above for a particular type of equation, 
certainly offers a very useful practical means of constructing the 
solutions of differential equations. We remark here that if, on passing 
from the differential equation to its general solution, we at no step 
violate the equivalence of succesive equations, there can be no 
singular solutions, i.e. every solution is contained in the general 
solution, on assigning various numerical values to C. In the case 
when the equivalence of the equations is lost, the singular solutions 
must be sought among the missing solutions, as will be done in [8] 
and [9]. 

By general solution is naturally understood a solution of the dif- 
ferential equation containing an arbitrary constant, from which can 
be obtained all the solutions defined by the existence and uniqueness 
theorem for initial conditions filling a certain domain of the (x, y) 
plane. This domain is determined by the function f(z, y) appearing 
in equation (42). It is natural to describe solutions of the differential 
equation as singular solutions when they have the property that the 
conditions guaranteeing the existence and uniqueness theorem are 
not fulfilled at any point of the corresponding integral curve. All 
these definitions require certain assumptions, of course, regarding the 
function f(z, y) or D(x, y, y’) appearing in equation (42) or (58). 

On replacing y’ by the arbitrary constant C, in equation (42) or 
(58), we get the family of curves: 


f(z, y) STR C; or D(x, Y, Cı) =0. 


Each curve of this family is the locus of points of the plane 
which are associated with the same slope, the family as a whole 
being referred to as a family of isoclines of the given differential 
equation, i.e. a family of curves of the same slope. In the particular 
case of the magnetic field at the earth’s surface, the isoclines are 
lines along which the direction of the magnetic needle is constant. 

The isoclines for the homogeneous equation of [3] were lines 
passing through the origin. 

We shall note the cases in which an isocline is an integral curve 
of the equation, i.e. gives a solution of the equation. We take the 


isocline: 
P(x, y, b) = 0, 
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corresponding to the particular value C} = b. At points of the 
isocline, the differential equation gives the same slope, inasmuch 
as y’ =b. A necessary and sufficient condition for the isocline to 
be a solution is that the tangent to the isocline is also of slope b at 
every point of it — whence it immediately follows that the isocline 
must be a straight line of slope b, since y’ = b gives y = br +c, 
where c is a constant. Hence, an isocline is a solution only when it ts 
a straight line and when the direction of this line coincides with the 
constant direction of the tangents, as defined by the differential equation 
at points of the isocline. 


Example. To find the curves for which the length of the normal MN is a 
constant a (Fig. 10). Use of the expression for the length of the normal [I, 77] 
gives us the differential equation: 


+ yVI+y?=a. (60) 
We get by squaring both sides of the equation and solving with respect 
to y’: 
dy, Jak — y? 
“ae £ a (81) 


The right-hand side of the last equation is only defined for | y| < a, ie. 
in the strip between the lines 


‘y=a and y= — a, (62) 
since otherwise the expression under the square root is negative; at every point 


inside the strip, y’ has two distinct values. 
The variables are separable in equation (61): 


y dy 


We eagty find on integrating: 
(2— CP +y =a, : (64) 


i.e. the family of circles with centres on OX and radius equal to a (Fig. 10). 
All these circles are situated in the strip bounded by the straight lines (62), 
with two circles of family (64) passing through every point inside the strip. 
The transition from equation (61) to (63) required division by ya? —y?, 
and as a result of this the solution y = +a might have been lost. It is easily 
seen by direct substitution that this is in fact a solution of (61). The solution 
is represented geometrically by the lines (62), which are not included in the 
family of the general solution (64); in other words, the solution cannot be found 
from (64) whatever the value of constant C, i.e. it is a singular solution. 
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Substitution of the constant C, for y’ in equation (60) gives us the family of 


isoclines: 
+y¥l+ Ci =a, 


These are lines parallel to OX. The tangents to circles (64) along these lines 
maintain a constant direction. 

The lines (62), in particular, are also isoclines, with y’ maintaining a constant 
value zero along them, which coincides with the slope of the lines themselves: 
so that the lines are at the same time solutions of equation (61). 


Teg a s 


Inside the strip given by the lines (62) we have two differential equations 
(61): one corresponding to the (-+) sign and the other to the (—) sign. The 
circles (64) inside the strip are obtained in accordance with the existence and 
uniqueness theorem. The theorem becomes inapplicable at points of the lines 
y = +a, and these represent singular solutions of equation (60) or (61). 


8. Clairaut’s equation. An equation of the type 
y = xy’ + oly’) (65) 


is called a Clairaut equation. Substitution of an arbitrary constant C 
for y’ gives us a family of isoclines of the equation: 


y = xC + 9(C). (66) 


Every isocline is seen to be a straight line, with slope equal to the 
constant that we substituted for y’, i.e. the direction of each of lines 
(66) is the same as the constant tangential direction defined by the 
differential equation at points of the line. Recalling what was said 
in the previous article, we can assert that each of lines (66) is also 
a solution of equation (65), i.e. the family of isoclines (66) is at the 
same time the family of the general solution of (65). 

We now indicate a second method of obtaining the general solution 
of equation (65), whereby the singular solution of the equation is 
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found, as well as its general solution. We use the notation y’ = p, 
and re-write (65): 
y = vp + 9(p). (67) 


It amounts to finding p as a function of v, say w(x), so that on 
substituting p = w(x) on the right-hand side of (67) we get for y 
a function of x such that its derivative y’ is: y’ = p = w(x). We 
take differentials of both sides of (67), expand the left-hand side as 
dy = y’dz = pdx, and obtain the first order differential equation 
for p: 

pdx = pdx + zdp + y’(p)dp or [z+ p (p)]dp = 0. 


We get two cases on equating each factor to zero. The case dp = 0 
gives p = C, where C is an arbitrary constant; substitution of p = C 
in equation (67) again gives us the general solution (66). In the second 
case we have the equation: 


a + g (p) =0. (68) 


On eliminating p from (67) and (68), i.e. from the two equations: 


y =xp + p(p) and æ+ g (p) = 0, (69) 


we likewise obtain a solution of equation (65), which does not contain 
an arbitrary constant. This is usually a singular solution of the 
equation. 

The geometrical problem of finding the curve, given the properties 
of its tangent, reduces to Clairaut’s equation, assuming that the 
properties relate only to the tangent itself, and not to the point of 
contact. The equation of the tangent has the form: 


Y —y =y (X — z) or Y =y X + (y — ey’), 


and any properties of the tangent are expressed by a relationship 
between (y — xy’) and y’: 


Py ~~ xy’, y’) =0. 


On solving with respect to (y — xy’), we arrive at an equation of 
the form (65). The straight lines composing the general solution of 
Clairaut’s equation are evidently of no interest as regards providing 
an answer to our geometrical problem, the answer being in fact 
given by the singular solution of the equation. 
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Example. To find the curve such that the intercept 7,7, cut off its tangent 
by the coordinate axes is of constant length a (Fig. 11). 

The equation of the tangent gives us the projections OT, and OT, of the 
tangent on the coordinate axes, and this enables us to write the differential 
equation of the required curve as: 


, 


(y — vy’? , , ay 
A Ht (y — vy’)? =a? or say + >=. 
y7 (y — vy’) y = ty Wiig 
The general solution is: 
ac 
mea cre im 


consisting of a family of straight lines, the length of the intercepts of which 
on the axes is equal to a. The singular solution is obtained as a result of eliminat- 
ing p from: 
ap 
= xq] +- 71 
yaad + a (71) 
and from the equation 


2 
oT ee 
YIFP o 
1+ p? ' 
which reduces to: 
eat toh 


(1 F pi) 
We write p = tan 9, giving 


xt 


x = F acos? gp 


whilst equation (71) for y gives us: 

y= F a cos? ọ tan ọ + a sin ọ = + a sin? g 
We eliminate g by raising the last two equations to the power 2/3 and 

adding: 


2 2 2 


a + y? — a , 
i.e. the required curve is an astroid, which we mentioned in [1, 80]. The 
straight lines (70) form the family of tangents to it (Fig. 11). 


9. Lagrangian equations. An equation of the form: 
y = wp, (y’) + p2 (4°). (72) 


is called a Lagrangian equation, 9,(y’) being assumed different from y’; 
if p(y’) = y’, we get the Clairaut equation just described. 

We use the same method of differentiation for (72) as for the 
Clairaut equation. We write y’ = p, so that the equation becomes 


y = xp, (p) + P2 (P). (73) 
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We take the differentials of both sides and get a first order equation 


for p: 
pdx = g; (p) dx + zg; (p) dp + p: (p) dp. 


Division by dp gives us the equation: 
d ’ , 
[v1 (P) — P] -3p + (P) e +o; (p) = 0, 


which, on the assumption that v is a function of p, is a linear differential 
equation. We reduce this to the form (25) by dividing both sides by 
[p(p)— p], and obtain its general solution in the form: 


x = y (p) C + 2 (p). (74) 


Substitution of this expression for v in equation (72) gives us an 
equation for y of the form: 


y = ys (P) C + v4 (p). (15) 


Equations (74) and (75) express v and y in terms of an arbitrary 
constant C and a variable parameter p, i.e. give the general solution 
of the Lagrangian equation in parametric form. On eliminating 
parameter p from (74) and (75), we get the ordinary equation for the 
general solution. 

When dividing the equation by dp, we may have lost the solution 
corresponding to dp = 0, i.e. corresponding to constant p, or what 
amounts to the same thing, to constant y’. But constant y’ leads 
to a first degree polynomial for y, i.e. the missing solutions must 
be straight lines, if they exist. We also note that, for constant p = a, 
(73,) gives a dz = 9,(a) dx, i.e. the value of the constant a must be 
defined by the equation ¢,(a) — a = 0. 

We give the geometrical interpretation of this last fact. Sub- 
stitution of constant C, for y’ in equation (72) gives us the equation 


of the isoclines: 
y = zo, (Cy) + P2 (C3), (76) 


i.e. the isoclines of a Lagrangian equation are straight lines. The solutions 
which are represented by straight lines have to be sought among the 
isoclines. For this, we have to establish the condition that the slope 
9,(C,) of the isocline is the same as the constant slope O, of the tangent 


along the isocline: 
pı (Cı) — C, = 0. 
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On solving this equation and substituting the value found for C, 
in equation (76), we obtain the required solutions, among which 
must be included the singular solution in question. 


10. The envelope of a family of curves, and singular solutions. 
We have already had two examples in which singular solutions were 
obtained in addition to the general solution. The general solution in 
the example of [7] consisted of the family of circles 


(z — 0} +y =a? (17) 


with centres on OX and of fixed radius a. 

The two lines y = +a, parallel to OX, were singular solutions. 
Any given point of these lines is a point of contact with a circle of 
family (77) (Fig. 10). The general solution in the example of [8] 
consisted. of a family of straight lines whose intercepts cut off by 
the coordinate axes were equal in length to the given a, whilst the 
singular solution was the astroid, such that any given point of it 
was a point of contact with one of the lines concerned, i.e. the family 
of straight lines was a family of tangents to the astroid. 

These examples lead us naturally to the concept of the envelope 
of a family of curves. Let the family of curves 


y(z, Y, C) =0, (78) 


be given, where C is an arbitrary constant. The envelope of the family 
is defined as the curve, every point of which is a point of contact with a 
curve of the family, i.e. the tangent at any given point of the envelope 
is also a tangent to the curve of family (78) that passes through this 
point. 

We derive the rule for finding the envelope. We start by finding 
the slope of the tangent to a curve of family (78). We differentiate 
equation (78), whilst taking into account that y is a function of 2 
and C is a constant; this gives us 


dp (x, y, O) Opa, y, C) dy __ 
are aye aw 


whence {I, 69]: 


yo (79) 
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We assume that the required equation of the envelope is 

E (x,y) = 0. (80) 
We can suppose that the left-hand side of this equation, R(x, y), 
which is as yet unknown, has the form y(x, y, C), where C, instead 
of being a constant, is some unknown function of x and y. For any 
given function R(x, y), in fact, we can write the equality 


Riz, y) = v(x, Y, C), 
which defines C for us as a function of x and y. In other words, we 
can look for the equation of the envelope in the form (78), except 
for C being a required function of x and y instead of being a constant. 
We differentiate both sides of (78), and obtain, since C is no longer 
constant: 


YC Y, C Y, 0 
IOLO arp PELO gy 4 EU) aco (81) 


The slope dy/dx of the tangent to the envelope must, by hypothesis, 
be the same as that of the tangent to the curve of family (78) that 
passes through the same point, i.e. equation (81) must give us 
equation (79) above for dy/dx; but this can only be the case when 
the third term on the left-hand side of (81) vanishes, i.e. when 
(3y(x, y,C)/aC) dC =0. The possibility dC =0 gives us constant O, i.e. 
a curve of the family and not the envelope; so that to obtain the 
envelope we must put 
i Op(a, Y, C) Za 0 

oc : 


This equation also defines C as a function of (x, y). Substitution 
of the expression obtained for C in terms of x and y in the left- 
hand side of (78) gives us the equation (80) of the envelope, i.e. 
the equation of the envelope of family (78) can be obtained by eliminating 
C from the two equations: 


, Y, C 
pæ un C)=0; 2AL Lo, (82) 


As we move along the envelope, we touch different curves of family 
(78), each curve being defined by its value of constant C; this makes 
it clear why the equation of the envelope was sought in the form (78), 
with C, however, taken as variable. 

We now turn to the singular solution of a differential equation. 
We let (78) be the family of the general solution of the differential 
equation: 

p (z, Y, y’) =0, (83) 
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i.e. the coordinates (x, y) and slope y’ of the tangent for any given 
curve of family (78) satisfy equation (83). At every pointof the envelope 
x, y and y’ will coincide with the v, y and y’ for some curve of 
family (78), i.e. the v, y and y’ of the envelope will also satisfy (83). 
In other words, the envelope of the family of the general solution is also 
an integral curve of the equation. 

If y(x, y, C) = 0 is the general solution of equation (83), elimination 
of C from equations (82) leads us to a singular solution in certain 
cases. We add the proviso here, ‘‘in certain cases” (and not always), 
due to the following considerations. It was assumed in the above 
arguments that curves (78) have tangents; therefore, if we eliminate 
C from equations (82), it is possible for us to obtain not only the 
envelope, but also the set of all the singular points of the curves of 
family (78), at which the curves do not possess definite tangents 
[I, 76].. Furthermore, it sometimes happens that the envelope itself 
enters into the constitution of family (78). We shall not give a rigorous 
treatment of the theory of envelopes and singular solutions. The 
theory must obviously be closely connected with the existence and 
uniqueness theorem, mentioned in [5]. We confine ourselves to 
explaining the problem in a few examples. 


1. We seek the envelope of the family of circles (77): 
(c— OF +p =a. 
Equations (82) here take the form: 
(x — 0} +y =a: —2(e@—C)=0. 


The second equation gives C = xv, and substituting this in the first equation gives 
us y2? = a?, ie. the set of two straight lines y = +a, which we obtained pre- 
viously. 

2. The general solution of Clairaut’s equation y = vy’ + p(y’) is 


y=2C + g (C). 
The envelope is obtained by eliminating C from the two equations: 
y=xC+9(C); 0=@+ g (C). 


These equations coincide with equations (69) of [8], with the trivial replacement 
of the letter p by C, i.e. we get the previous rule for finding the singular solution 
of Clairaut’s equation. 

3. The curve y? =g? is the so-called semicubical parabola (Fig. 12). On dis- 
placing the curve parallel to OY, we get a family of semicubical parabolas: 


(y +O) =a. 
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Each of these curves has a cusp on OY, and there exists a right-hand tangent 
at the cusp, parallel to OX. Equations (82) here take the form: 


(y+O)=2; 2y+O)=0. 


Elimination of O gives us æ% = 0, i.e. axis OY. Axis OY is not the envelope 
in this case, but the locus of singular points of curves of the family. 
4. We consider the family of curves 


y =0 (2 —0}. 


Fie. 12 Fie. 13 


We have a parabola for C + 0, and the axis OX for C = 0. Equations (82) 
become: 
y = C(a—C)*; (#—C)(#—3C)=0. 


The second equation gives C =a or C = 2/3. Substitution in the first 
equation gives us either y = 0 or y = 423/27. The first curve y = 0 is axis 
OX, which belongs to the given family of curves; whereas the cubical parabola 
y = 423/27 is the envelope of the family. 

5. We take the chords of the circle of unit radius, centre at the origin, that 
are perpendicular to OX and we draw fresh circles with the chords as diameters, 
thus obtaining a family of circles. If v =C is the point of intersection of a chord 
with OX, the square of the radius of the corresponding circle is (1 — 0?) (Fig. 
13), so that the equation of the family is: 


(x — 0} + y2=1~— C2. 
Differentiation with respect to C gives us the equation: 


— 2 (æ — 0) = — 20; 
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on eliminating C from the last two equations, we get the equation: 
x? 
2 +p = 
2 y=l, 


ie. we obtain an ellipse with semi-axes y2 and 1, with the coordinate axes 
as axes of symmetry. It is obvious from the figure that this ellipse does not touch 
all the circles of the family. 


11. Equations quadratic in y’. We consider in more detail, from the point 
of view of singular solutions, differential equations that are quadratic in y’: 


D (x, Y, y’) ea y”? + 2P (x, y) y + Q (x, y) =0, (84) 


where P(x, y) and Q(x, y) are single-valued and continuous, and have continuous 
derivatives with respect to y, throughout the domain; e.g. they may be poly- 
nomials in x and y. We obtain on solving with respect to y’: 


y’ =— P (x, y) 4+ VR (x, y) (85) 


where we have taken R(x, y) = [P(x, y) — Q(z, y). In the part of the domain 
where R(x, y) > 0, (85) is equivalent to two differential equations, and in 
accordance with the existence and uniqueness theorem, two and only two 
integral curves will pass through every point of this part of the domain. Dif- 
ferential equation (84) will have no singular solutions in this region. In the 
region where R(x, y) < 0, equation (85) does not yield a real y’, and there are 
no integral curves in this region. Finally, we consider the equation 


R(x, y)=0, (86) 


which can define one or more curves in the domain. It is only among these 
curves that singular solutions of equation (84) can be found. We remark that 
(86) can be obtained by eliminating y’ from (84) and the equation: 


=0. ie y’+P(a, y)=0. 


The latter equation expresses the fact that (84) has a multiple root with respect 
to y’. 
1. In the case of the equation 


y=ry+y%, ie y?+2y’—-y=0 


(86) takes the form x?/4 + y = 0, andthe parabola y = — x?/4 is a singular 
solution of the Clairaut equation written. 
2. In the case of the equation 


y? + 2ry +y=0 


(86) gives y = x?. This parabola does not satisfy the equation written, so that 
the latter has no singular solution whatever. 
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12. Isogonal trajectories. An isogonal trajectory is defined as the 
family of curves intersecting the curves of the family 


y(x, y, C) =0 (87) 
at a given angle. 

If the given angle is a right angle, the trajectory is called the 
orthogonal trajectory. We show that finding an isogonal trajectory 
leads to integrating a first order differential equation. 

On eliminating C from the equations: 


’ ,Cc 3 , 0 , 
v(t, y C)=0; FELO 1 VEU yo, 


we obtain the differential equation of the given family (87) as in [7]: 
D(x, y, y)=0. (88) 


We start by finding the orthogonal trajectory. In this case, the 
tangents to the required curves are perpendicular to the tangents to 
the curves of family (87) at the points of intersection of the curves, 
i.e. the slopes of the tangents to the trajectory are the reciprocals, 
with reversed sign, of the slopes of the tangents to the given family. 
Hence it follows that, to obtain the differential equation of the orthogonal 
trajectory, we must replace y’ by (— 1 ly’) in the differential equation of 
the given family. 

Finding the orthogonal trajectory thus reduces to integration of 
the equation: 


D (x, Yis —~=)=0, 


where y, is the required function of x. 

We now turn to the general problem of isogonal trajectories. Let 
p be the constant angle at which the curves of the trajectory intersect 
the curves of family (87). Let y, denote, as before, the ordinate of 
the required curve; on using the formula for the tangent of the 
difference of two angles: 


_ tany, — tany, 


pen (Wir) =T eny iano? 


where tan y = y’ is the slope of the tangent to a curve of (87) and 
tan y, = y’,is the slope of the tangent to the required curve, we 
can write 

` yi y 


IFy y = tan P, (89) 


38 ORDINARY DIFFERENTIAL EQUATIONS [12 


where ¢ is measured from curve (87) to the required curve. On eliminat- 
ing y’ from the last equation and equation (88), we obtain the 
differential equation of the isogonal trajectory, which then has to be 
integrated. 


We come across orthogonal trajectories when considering plane fluid flow. 
We assume that the fluid flow takes place in a plane, so that a vector v, the 
velocity of motion, is defined at every point (x, y) of the plane. If the velocity 
vector depends only on the position of the point in the plane, and not on time, 
the motion is described as steady or established. We shall confine ourselves to 
this type of motion. We further assume that there exists a velocity potential, 
i.e. that the projections of vector v(x, y) on the coordinate axes are the partial 
derivatives ðu(x, y)/Ox and Ou(x, y)/Oy of some function u(x, y). The curves of 
the family 

u(z, y)=C (90) 


are described in this case as eguipotential lines. 

The lines, the tangents to which have, at every point, the same direction 
as the vector v(x, y), are called stream lines and give the trajectories of the mov- 
ing particles. We show that the stream lines 
form the orthogonal trajectories of the family 
of equipotential lines. 

Let ¢ be the angle formed by the velocity 
vector v(x, y) with axis OX, where | v | is the 
length of this vector. By hypothesis, du(x,y)/ox 
and u(x, y)/dy are the projections of v(x, y) 
on the axes, i.e. 


Ou (x, y) 

Ou (x, y) $ 

m =]v!-sin ; 
oy | | g 


whence we obtain the expression ‘for the 
slope of the tangent to a stream line as: 


u (x, y) 
Fie. 14 tang = (91) 


The slope of the tangent to an equipotential line (90) is found by differentiating 
this equation with respect to a: 


Ou (x, yY) 
Ou (x, y) ðu (x, y) , SANO _ or 
peek eeg Se coe u (z, y) ’ 
Oy 


i.e. we obtain a slope which is the reciprocal, with reversed sign, of slope (91). 
Hence it follows that the equipotential lines and the stream lines are orthogonal 
to each other. 
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Ifa family of curvesis a family of equipotential lines, its orthogonal trajectories 
form the family of corresponding stream lines, and vice versa. In the case of a 
plane electrostatic field, the lines of force represent the orthogonal trajectories 
of the family of equipotential lines. 


Example. To find the isogonal trajectories of the family 
y = 0x". (92) 
On eliminating C from the equations 
y = Ca"; y’=Cma™?, 
we get the differential equation of family (92): 


tm gy YE 
am 
On substituting this expression for y’ in (89), we get the differential equation 


of the required family: 


rama 

Y T 1l 
yy k?’ 

tme 


the constant tan ọ being written as 1/k, and writing simply y instead of y,. 
This equation reduces to the form: 


y= ——_ 7 (93) 


and is therefore a homogeneous equation [3]. 

If m = 1, (92) is a family of radius vectors passing through the origin, and 
the required curves must cut these at a constant angle, i.e. they are either 
logarithmic spirals [I, 83] or circles. 


Ifm = —1 and k = 0, the problem reduces to finding the orthogonal trajec- 
tories of tho rectangular hyperbolas 
ty =0. (94) 
Here, (93) reduces to the equation with separable variables: 
dy 


Integration again gives a family of rectangular hyperbolas, referred in this 
case to the axes of symmetry: 


x? — y =0. 


As may easily be seen, this family is obtained from the given family (94) by 
turning it through 45° about the origin. In general, for k = 0, (93) reduces 
to the form: 

dy _ x 


dx my 
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and its general solution is: 


my +=, 


i.e. the orthogonal trajectories of family (92) consist, for m>0, of a family of 
similar ellipses, and for m < 0, of a family of similar hyperbolas. The orthogonal 
trajectories of the parabolas y = Cz? are illustrated in Fig. 14. 


§ 2. Differential equations of higher orders; 
systems of equations 


13. General principles, An ordinary differential equation of the 
nth order has the form: 


@ (x, Y, y’, y”, e.. y™) =0, (1) 


or, on solving with respect to y: 
YO = f (8, Y Yr Y”, ve YED). (2) 


Every function y of the independent variable x that satisfies 
equation (1) or (2) is called a solution of the equation, whilst the actual 
task of finding the solutions of the equation is described as the task 
of integrating the equation. We take as an example the linear motion 
of a point-mass of mass m under the action of a force F, which depends 
on time f, on the position of the point and on its velocity. If we take 
as axis OX the straight line along which the point moves, the force F 
can be considered as a given function of ¢, x and dv/dt. By Newton’s 
law, the product of the mass of the particle and its acceleration must 
be equal to the force acting. This gives us the differential equation 
of motion: 

da 


m -jp 


dg 
= F (t, t, $) (3) 
Integration of this second order equation determines the relationship 
between x and ¢, i.e. the motion of the particle under the action 
of the given force. In order to obtain a definite solution of the problem, 
we must also specify the initial conditions of the motion, i.e. the position 
of the particle and its velocity at some initial instant, say at t = 0: 
da ; 


x a to -Fr eT Lay (4) 
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In the case of the nth order equation (1) or (2), the initial conditions 
consist in a specification of the function y and of its derivatives up 
to and including the (n — 1)th order for a given value of x = 24: 


Hime =Y Use Yard WO. =. (5) 


The Yo Yé, ---, yf» here are definitely assigned numbers. 


A uniqueness and existence theorem is valid for the nth order 
equation, as for the first order equation, and can be stated as follows: 
if fla, y, y’, «++, YTP) is a single-valued function of its arguments, is 
continuous for all x in the neighbourhood of x, and for all y, y’, ..., 
y"—» in the neighbourhood of (5), and has continuous first order partial 
derivatives with respect to y, y’, ...,y~, a single definite solution of 
equation (2) corresponds to initial conditions (5). 

On varying the constants Yo, yg, --.; yr) in the initial conditions, 
we obtain an infinite set of solutions, or more accurately, a family 
of solutions, depending on n arbitrary constants. These arbitrary 
constants can appear in the solution, not as initial conditions, but 
in the more general form: 


y= p(z, Ci, Ca. ss On) (6) 


Such a solution of equation (2), containing n arbitrary constants, 
is called the general solution of (2). The equation of the general solution 
can also be written in implicit form: 


p(z, Y, C1, Oa. oe Cn) =0. (7) 


On assigning definite values to constants C,,C,, ...,C,, we obtain 
particular solutions of the equation. 

We obtain n equations by differentiating equation (6) or (7) (n — 1) 
times with respect to x then substituting x = x, and initial con- 
ditions (5). It is assumed that these equations are soluble with respect 


to O1,C,,...,C, for any given initial conditions (£o, Yo, Yo, -- 
y$'-») from a certain interval of variation of £y Yo yo, --- ys >. 


We thus obtain the solution satisfying conditions (5). If the right- 
hand side of equation (2) is a many-valued function, there will be 
several solutions of equation (7) corresponding to initial conditions (5). 
Every solution not included in the family of the general solution, 
i.e. not obtainable from (6) for any values of constants Cs, is called 
a singular solution of the equation. 

The remarks made in [7] in connection with first order equations 
must be borne in mind as regards the concepts of general solution 
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and singular solutions. These concepts have to be related to the 
existence and uniqueness theorem. 

If the right-hand side of equation (2) is expanded into a series 
arranged in positive integral powers of the differences: 


(x ao To), (y Giz Yo) (y’ — Y) vee (y9 — yr), 


on the hypothesis that the absolute values of these differences do not 
exceed a certain positive number, the solution satisfying initial 
conditions (5) can be represented as a series 


yo + 8 (r — a) + (ea)? +... (8) 


for all x sufficiently near x). Here, equation (2) itself gives fully defined 
values of the coefficients of the series, as in the case of first order 
equations [5]. In fact, on substituting x = x, and initial conditions 
(5) in the equation, yo” can be found. We then differentiate (2) with 
respect to x, substitute «= 2, and initial conditions (5) and y™® = 
= y™, and thus find y{"t”, and so on. 

Another procedure can be adopted for finding the coefficients of 
the series, that of replacing y on both sides of equation (2) by the 
power series: 


y=% +É (z — a) + 4 © — ay)? +... + 


Fei i—i (E — To)" + an (a — 2o)" + anpi (E — o) h.n. 


with undetermined coefficients an, an4,,.... We arrange the right- 
hand side of the equation obtained in powers of (x — £o), then suc- 
cessively determine the coefficients just mentioned by equating the 
terms in like powers of (x — x) on both sides of our identity [5]. 


Example. We consider the motion of a particle of mass m along a straight 
line under the action of an elastic force tending to pull the particle back to its 
position of equilibrium and proportional to the displacement of the particle 
from this position. We further assume that the motion takes place in a medium 
whose resistance is expressed as the sum of two terms: the first directly pro- 
portional to the velocity, and the second proportional to the cube of the velo- 
city. If we let x denote tho displacement of the particle from its equilibrium 
position, we get the differential equation: 


ae ew. Z 13 
ma” = — kx — kx’ — kx’, 


where k,, kz, k, are positive coefficients of proportionality. 
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We take a numerical example: 
g” = — g — 0.12’ — 0.1 x3 (9) 
and we look for the solution satisfying the initial conditions: 
Bling = 20 = l; 2’ lemo = 26 = 1, (10) 


as a series arranged in powers of t. We differentiate equation (9) with respect 
to t: 


g” = — gx — 0.12” — 0.302 x” 
a= a m Ole” — 0.3 (8 a + 2a a?) (11) 
sO) =a” — 01a) — 0.3 (6a! a a p at aC) 4 2a) 


a) — gE 01r) — 0.3 (1202 a!” + Gar’ a + Ba’ oo” ol) 4 2 aw, 
We substitute the initial values (10) in equations (9) and (11), and successively 
compute the initial values of the derivatives: 
t=1; só=l; zg =— 1.2; ző = — 0.52; af) = 0.544; 
a) = 0.2160; av) = 3.1453. 
On applying Taylor’s formula, we get an approximate expression x, for the 
required solution:t 


a, = 1+ t — 0.642 — 0.0867: + 0.022744 -+ 0.001815 + 0.004428, 
xi = 1 — 1.2¢ — 0.262 + 0.90728 + 0.000024 + 0.026225, 


wf = — 1.2 — 0.52 + 0.2722 + 0,036: + 0.131144, 


which gives a good degree of accuracy for ¢ near zero. 


14. Graphical methods of integrating second order differential equations. 
There is a corresponding curve for every solution of a differential equation of 
the nth order, and, as in the case of first order equations, we shall call the 
curve an integral curve of the equation. In the case of a first order differential 
equation, there was a corresponding tangent field [5]. 

We now explain the geometrical significance of the second order equation 


y” = fæ, y y’). (12) 
Ţ It is to be noted that we obtain the series for x; and zf, not by differentiating 


the series for %,, but 7 applying Taylor’s formula to x, and x}: 


a’) (vi 
B+ up x = 


aj = af + = Ae a ag 


g T ve” afl’) a) x 


af = ap e e 


vi) 
di t. 
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Let s be the length of arc of the integral curve, and let a be the angle that 
the positive direction of the tangent forms with the positive direction of OX. 
We have [I, 70]: 


dy da 
r tana; er cos a, 
and we obtain, on differentiating with respect to x: 
dy 1 da _ 1 da ds — 1l da 
dz? ~ cosa dz  cos?a ds dz cosa ds ’ 


but da/ds is the curvature of the curve, as we know from [I, 71] 
da 1 


“ae RB” (13) 
and the previous equation gives us: 
1 d? 
E = cos? a- a (14) 


We take R positive here, if a increases with increasing s, and negative if 
a decreases with increasing s. 

We take, say, OX directed to the right, and OY directed upwards (Fig. 15). 
With this, if R > 0, the curve rises from right to left with increasing s 
(counter-clockwise), and in the opposite direction if R <0. 

By (l4), the differential equation (12) can be 
rewritten in the form: 


+ = f(x, y, tan a) cos? a. (15) 
It is clear from this that a differential equation 
of the second order gives the radius of curvature, if 
the position of the point and the direction of the 
tangent at this point are given. 
This fact gives rise to the method of approxi- 
Fig. 15 mating to the integral curve of a second order 
equation by means of a curve with a continuously 
varying tangent and composed of the arcs of circles. 
This method is analogous to that of approximating to the integral curve of a 
first order equation by means of a step-line [5]. 
We take the initial conditions for the required integral curve as: 


Ylxeo = Yos Y lxmo = YO: 

We mark off the point M, with coordinates (£o, Yo) and draw M T , through 
the point with slope y’ = tana = y, (Fig. 16). 

Equation (15) gives us the corresponding R = R,. We draw M,C, perpendi- 
cular to MT, and equal in length to Ry, then with C, as centre construct a 
small arc M,M, of a circle of radius Ry. 

We notice here that the direction of M,C, is determined by the sign of 
R, by what was said above. If, for instance, R, < 0, movement must be clock- 
wise along the arc of the circle from M, to M, (Fig. 16). Let (x,, y,) be the co- 


14] GRAPHICAL METHODS OF INTEGRATING SECOND ORDER EQUATIONS 45 


ordinates of point M, and tan a, be the slope of the tangent M,T, to the circle 
passing through M,. Equation (15) gives the corresponding R = R,. We con- 
struct M,C,, equal in length to R,, and perpendicular to M,T,, i.e. lying along 
the straight line M,C, its direction being determined by the sign of R,; then 
with C, as centre, we draw a small arc M,M, of radius R,. We proceed from M, 
as from M,, i.e. find from (15) the corresponding R = R,, draw the line M,C,, 
equal in length to R, etc. 

A straight rule is used for the above construction, with a hole for a pencil 
at one end. The quantity R is measured off on a graduated scale that runs 


Fie. 16 Fria. 17 


along the rule from this hole. One leg of a small tripod device is located at 
the point corresponding to R, whilst the other two legs are on the paper. On 
shifting the tripod along the scale at points M,, M, etc. in accordance with 
the variation of R, we do not alter the direction of the tangent at these points; 
hence we obtain the required curve. 

We now give another method of graphical integration of equation (12), 
providing an approximation to the integral curve in the form of a step line. 
The method is a generalization of that illustrated in Fig. 9. In addition to y, 
we introduce the unknown function z = y’. We now obtain, in place of the single 
second order equation (12), a system of two first order equations with two 
unknown functions y and z: 

Yes SE = feya) (16) 

We apply the method to be explained in the general case of any two first 
order equations: 


SY = gla, Y: z); Z = f(a, Ys z). (17) 


We take v as abscissa, and y and z as ordinates in the same coordinate 
system, so that there will be two integral curves corresponding to every solu- 
tion of system (17). 
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We mark off the unit length OP along the negative direction of the abscissa 
(Fig. 17). The values of f(x, y, z) and g(x, y, z) are marked off on the axis OY, 
using the scale in which OP is unity; a different scale to that used for the 
functions may be used for x, y and z. 

Let the solution of system (17) be required which satisfies the initial condi- 
tions: 


Ylx=xo = Yo; 
Blesky =, Zo 


We draw a series of straight lines, parallel to the y axis: 


z=% 
v= t; 
L = Nyy... 


We mark off points M, and N, with coordinates (£o, Yo) and (To, Zo). We 
take OA, and OB, along the y axis, equal to 9(%o, Yo Zo) and f(x, Yo, Zo) respec- 
tively. The lines PA, and PB, will have slopes g(x, Yos Zo) and Fl£o Yo Zob 
and will therefore give the directions of the tangents to the required integral 
curves at the initial points M, and N,. 

We now draw from these latter points M,M, and N,N,, parallel to PA, 
and PB,, to their intersections with the line x = x, Let (x, Yı) and (#,, 2) 
be the coordinates of the points of intersection M, and N,. We now mark off 
OA, and OB, on the ordinate axis, equal in length to g(2,, Yı» 21) and f(a, Yy 2). 

From points M, and N, we draw M,M, and N,N;, parallel to PA, and PB,, 
to their intersections with % = x, and so on. We thus obtain two step lines 
M,M,M,... and N,N,N, ..., representing approximations to the required 
integral curves. 

The construction is simplified in the case of system (16), since g(x, y, z) 
coincides with the ordinate z of the second line N,N,N, ... The second line 
here gives an approximate graphical representation of the first derivative y’. 

The construction is greatly simplified if the differential equation has the 
form: 


y” = h (£) + f (Y) + fa (Y°) 


which is often encountered in the investigation of the vibrations of material 
systems with one degree of freedom. 
The equation written is equivalent to the system: 


2 = fı (x) + fa (2) + fa (2). 


If the graphs of the functions f,, fz and f; are drawn with the same ordinate 
scale, we can determine f(z, y, z) by simple addition of the ordinates of these 
three curves for selected corresponding values of the abscissae x, y, and z. 

The method described can also be used for systems of n equations of the first 
order with n unknown functions. We remark that it is sometimes more con- 
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venient to mark off the unit vector which we denoted by OP, as also the values 
of the functions g(x, y, z) and f(x, y, z), from some other point O, of axis OY, 
instead of from the origin O. This is done so as to avoid the lines PA,, PBo, ..-; 
giving the directions of the step line, intersecting with the step line itself. 

Figure 18 illustrates the construction of the solution of equation (9), satisfying 
the initial conditions (10). 


Fra. 18 


15. The equation y = f(x) The equation: 


y = f(z) (18) 
is a direct generalization of the equation y’ = f(x). We start by 


deriving the formula for the general solution of equation (18). Let 
y(x) be any solution of (18), i.e.: 


y (2) = f(a). (19) 
We introduce a new required function z in place of y, given by: 
Y = Yi (2) + z. (20) 


Substitution in (18) gives us the equation for z: 
yy) + zn) = COP 
a= 0. 


or, using identity (19): 
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Since the nth derivative of z must vanish, function z itself is a 
polynomial of degree (n — 1) with arbitrary constant coefficients: 


z= 0 + 0w t... + Ont, 
and (20) gives the general integral of equation (18): 
Y = Y (x) + Ci + Caw t... H On, 
i.e. the general solution of equation (18) is the sum of any particular 
solution of the equation and a polynomial of degree (n — 1) with arbitrary 
coefficients. 


It remains for us to find a particular solution of equation (18). 
We shall seek the solution satisfying the zero initial conditions: 


y has =0; 
y’ leew = 0; | (21) 
Ge a Oe 


On integrating both sides of equation (18) from x, to the variable 
limit v, we obtain: 


x 
yr) — yf = ff (z) de, 
where y$"~» is the value of y"-) for x = ay. 
We have y{"~» = 0 by the last of conditions (21), so that: 


yO = f} (a) de, 


We obtain y$? by again integrating the right-hand side of this 
equation with respect to x between the limits 7) and x, and by proceed- 
ing in this way, we finally obtain the required function after the nth 
integration. We usually write this iterated integration as: 


g £ x a 
y= f de (dx... (dx f f(x) de. (22) 
The n times repeated quadrature can be replaced by a single quadra- 
ture, as we shall now show. 

We expand y(x) by Taylor’s formula, with the integral form of 
remainder term [I, 126]: 


y (2) = yo + (2 — %) + (@— a)? B+... + 


n—1) 
+ (a — not r t eny Seat, 


Te 
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where Yo, Yo, Yor «+ +> y'-? are the values of y and its derivatives 


for 2 = 2, whilst ¢ simply denotes the variable of integration. By 
initial conditions (21): 


Yo =Y =N= -=y 0, 


whilst y(t) = f(t) by differential equation (18); hence Taylor’s 
formula above gives: 


y (0) = map [et fat. (23) 


To 


Formula (23) gives the solution of equation (18) for the zero initial 
conditions (21), or, what comes to the same thing, gives an expression 
for the repeated integral (22) in the form of a single integral. 

We get the general solution of equation (18) by adding a polynomial 
of degree (n — 1) with arbitrary coefficients to solution (23). We notice 
that x appears as the upper limit of integration, as well as under 
the integral sign, on the right-hand side of (23). Integration is carried 
out with respect to ¢, x being meantime considered constant. Formula 
(28) is obviously also valid for n = 1, provided we take 0! = 1. 


16. Bending of a beam. We consider an elastic, prismatic beam, bending 
under the influence of external forces that may be both concentrated anp 
continuously distributed (weight, loading). 

We take OX along the neutral axis of the beam in its undeformed state, and 
OY vertically downwards (Fig. 19). We use the convention that forces acting 
on the beam are positive if directed down- 
wards. We isolate section N of the beam 
with abscissa v. 

Let y denote the displacement of the 
point on the neutral axis, and R the radius 
of curvature of the deformed axis. It is 
shown in the theory of strength of ma- 
terials that, with certain assumption regard- 
ing the character of the deformation and 
the position of the beam relative to axes OX, 
OY, the equation of equilibrium is to be 
obtained as follows: we neglect the part of Fic. 19 
the beam either to the left or to the right of 
N, and calculate the bending moment M(x), 
equal to the sum of the moments about the neutral line of section N of all 
the external forces acting on the neglected part, these moments being reckoned 
positive if, in the case of neglecting the left-hand part, they have a counter- 
clockwise rotation, or in the case of neglecting the right-hand part, they have 
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a clockwise rotation. The differential equation of the bent axis of the beam 


now reads: 

EI 

-p = Me), (24) 
where £ is the modulus of elasticity, and I the moment of inertia of the cross- 
section considered about the neutral line contained in it. 

Taking the deformations as in general small, and the axis of the beam on 

deformation as differing only slightly from axis OX, we can neglect the square 
of the small quantity y’ in the expression for R [I, 71]: 


ty 1 
is y” y’ 


which gives, on substituting from equation (24): 


(25) 


We now suppose that the only concentrated forces are at the ends of the 
beam, being equal respectively to P, and P, (in the case of Fig. 19, P, is nega- 
tive); in addition to these, there are bending couples at the ends, the moments 
of which will be denoted by M , and M,. The distributed loading per unit length 
of the beam is denoted by f(x). 

We find the sum of the moments of the external forces acting on the part 
NL of the beam (Fig. 19). The loading from any element dé with abscissa & 
is f(¢) dé, and its moment about N is 


(E — x) f ($) dé, 
so that the total moment from the full loading of this part is: 


i 
3 (E — a) f (£) dé. 


On adding the moment of the force P;, equal to (l — x) P, and the couple 
of moment M,, we get: 


i 
M(a)= $ (E— 2) f (8) GE + (0 — 2) Pit Mi. (26) 


If we calculated with the above sign convention the sum of the moments 
of all the external forces acting on the part ON of the beam, we should get: 


x 
M(x) = Í (x — £) f (E) d -+ £ P, + Mo. (27) 


It is easily verified directly that both these expressions are equal. In fact, 
the equation 


li x 
S (E — a) f (E) dE + (1 — a) Pi + Mi = f (æ — £) f (E) dE + 2P, + M, 
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reduces to the following: 
t l t 
olf Í (8) dE + P, + + = |. Sf (5) dé + IP, — My + a =0, 
which is in turn an immediate consequence of the equations: 


l 
f EE dE + Po + P= 0, (28) 
0 


l 
{ EE) dé +IP, + Mi — M, = 0. (29) 
0 


The first of these expresses the vanishing of the sum of all the external forces, 
whilst the second equates to zero the sum of the moments about the point 
O of all the external forces acting on the beam, i.e. they simply express the 
conditions of equilibrium. 

On recalling the expression in [15] for an iterated integral in the form of a 
simple integral, we can write, by (27): 


x x 


M (a) = a dx J f(x) da +aP,+M,, (30) 


whence 
aM j 
s e) = AOP B) 
0 
d 
se = f (a). (32) Fia. 20 


The quantity S(x), equal to the sum of all the external forces acting to the 
left of section N, is called the shear at section N. Equation (31) shows that the 
shear is equal to the derivative of the bending moment. 

Equation (32) has the same form as (25), if we replace in the latter the unknown 
function y by M(x) and the right-hand side M(x)/EI by f(x). This substitution 
is of great importance in graphical statics. 


Examples. 1. A beam is constrained at the end O and subjected to a con- 
centrated vertical force P at the end L (Fig. 20); the weight of the beam can 
be neglected. We have in this case: 


f(z) =0; P= P; M,=0; M (x) = (l — x) P,, 


and the equation of equilibrium (25) becomes: 


E fia 
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The sag must be zero at the constrained end « = 0, and the tangent to 
the bending axis must coincide with OX, i.e. we have the initial conditions: 


Y |x-0=0 and y'j|yao=0, 


so that we find [15]: 


ce eae S 
0 


The sag at the end L of the beam is given by: 


PB 
h = y |en = 3h: 


The supporting reaction will operate only at end O. Noting that continuous 
loading is absent here and that M, = O, we have from equations (28) and 
(29): Ry = P, = —P (reaction force); M, = IP, (reac- 
tion couple). 

2. We find the curve of bending of a girder, rest- 
ing on two supports A and B (Fig. 21) and subjected 
to a head of water whose level is opposite the upper 
support (dam). The forces acting on the girder here 
amount to (1) the continuously distributed head of water, 
and (2) the reactions at the supports. 

Let b be the width of the girder and g be the weight 

Fig. 21 per unit volume of water. If we take a strip of the girder 
of breadth dz at a depth x below the level of the water, 
the head of water at the strip is the weight of a column 

of water with its base equal to the base-area of the strip and its height equal 
to the depth of submersion of the strip, i.e. 


e’ b. dx. x = kg de (k = ob). 
Thus we have in this case: f(x) = —ke. 
The problem therefore amounts to investigating the bending of a supported 
beam under the action of continuously distributed loading f(x) = — kx. 


We start by calculating P, and P,, the reactions of the supports. The total 
loading is 


l 
2 
P= f kë ag = tE., 
2 
0 
The reactions at the supports O and L due to the elementary loading kédé 
are, in accordance with the usual law of levers: 


kE lE) Lin 5) dé and HP ag. 
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Hence obviously: 


l 
_ ò[ kēll—8) _kP 1 oe 2 
r= | E a= aap, = P-GP. 
(o) 
We have further, by (26): 


M (z)=— [@-a kde + (= a) P, = 
0 


= -k [€a dt +5 PU a) = (erm). 


Differential equation (25) of bending now becomes: 


—k 
z a 3 
with the obvious conditions: 
Ylxeo= 93 -Ylxay = 0. (34) 
The general solution is: 
— k x5 p x3 
Y= “GET (5 = “p+ 0,2 +0,). 


7 
C, = 0; CQ, = Eg 
whence finally: 
—k 
= —___ (325 — 10/2 4 
y 360EI (3a 102? x3 + 7142). 


To find the position and value of maximum deflection we put x = lt, and 
re-write the above expression for y as: 


y= eae 5 (315 —108+-71) (0<1<)). 


The derivative of the polynomial in brackets: 
15¢4 — 30 + 7 


has only one zero in the interval (0, 1): 


t = f= 2] ~o. 519.. 


which corresponds to a maximum for |y |. 
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Maximum deflection thus occurs towards the end L and not at the centre, 
its value being: 


T _ kB : —k oag 2.348 Pre 
h = Ylxette = -gopr C — 10 + 1) ~ -goar * 2-348 = oar —- 


17. Lowering the order of a differential equation. We notice a 
number of particular cases in which the order of an equation can be 
lowered. 


1. Let the function y and a certain number of consecutive derivatives 
of y: y’, y", ..., yF, be excluded from the equation, which has 
the form: 

D(x, y¥, Yr", y™) = 0. 


We introduce the new variable z = y“, thus lowering the order 
of the equation by k: 
D(x, 2,2’,..., 207%) = 0. 
On finding the general solution of the last equation: 
z = p(x, Cis Ca.. e Crags 
we can find y from the equation: 
y” T g(x, 0;, C2, siy Cn—1)s 


which we discussed in [15]. 
2. If the equation does not contain the independent variable v, 
i.e. has the form: 
Dy, y’,y",...,y) = 0, 
we take y as independent variable and introduce the new function 
p= y. 
If we take p as a function of y, and dependent on x via y, and use 


the rule for differentiation of a function of a function, we get the 
following expressions for the derivatives of y with respect to x: 


and it is clear from these that the order of the equation is (n — 1) 
in the new variables. 
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If the transformed equation is integrated: 


p= HY, Ci» Co, paaa Cn—1) 
the general solution of the given equation can be obtained by a quad- 
rature: 


dy = p dz = (y, Cy, Cos.. -s Cn—1) dz, 


whence: 


dy 
=g + On 
Í p(y, Cis Oze- -3 Cn) tOn 


One of the arbitrary constants, Cn, appears as an addition to g, 
which is equivalent to the fact that any integral curve can be displaced 
parallel to OX. 

3. If the left-hand side of the equation: 


D(z, y,y’,...,y™) =0 


is a homogeneous function [I, 154] of arguments y, y’, ..., y, the 
introduction of a new function u(x) in place of y, given by the formula 
gu dx 
YED 


results in an equation of order. (n — 1) for u. This follows from the 
obvious formulae: 


y’ = e u; y” = eF w’ ur)s.., 


and from the fact that, after substituting in the left-hand side of the 
equation, a certain power of the exponential function written above 
can be taken outside (by the condition of homogeneity) and can then 
be cancelled out. The arbitrary constant of the integration in the 
power of e is an arbitrary factor of y. 


Examples. 1. An equation of the form: 
y” = f(y) (35) 


belongs to case 2. It can also be integrated directly. We multiply both side 
by 2y’ dx = 2dy: 


2y’ y” dæ = 2f(y) dy. 


The left-hand side is obviously the differential of y’?, and integration gives 
us: 


y 
d pa 
y? = fe (y) dy + C, =f, (Y) + C, whence a =Vfi (y) +O; (36) 
Yo 
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We separate the variables and integrate: 


y 
dy 
s46 = f = ; (37) 
AMES 
If the initial conditions are: 


Y |x=xo =Y Y liemixg = yb; 
we obtain, on substituting x = £o, Y = Yo, Y’ = Y in (36) and (37): 
CL=y0 C,=—%, 
and the required solution becomes: 


J 
dy 


f y i 
Yo IR 2f (y) dy + y2 


Let a particle move along the x axis under the action of a force F(x) which 
depends only on the position of the point. The differential equation of motion 
is [13]: 


T — ty = 


d?z 


Let x9, vo be the initial abscissa and initial velocity of the particle at ¢ = 0: 


. dz 
= vo; 


az —— 
t=0 dż 


=v: 
t=o 


If we multiply both sides of the equation by (dv/dt) dt and integrate, we 
get: 


x x 
l fds) 1 l fdr} 1 
zola) = = mop = |F (x) de or =™(G) — [F œ) dx = mo? (38) 
Xe 


The first term on the left-hand side, m(da/dt)?/2, consists of the kinetic 


x 
energy, and the second term — f F (x) a of the potential energy of the 
Xe 


moving particle; and it follows from (38) that the sum of the kinetic and potential 
energies remains constant throughout the motion. We obtain the relationship 
between x and ¢ by solving (38) with respect to dt and integrating. 

2. If the bending of a beam is too large to allow for the second derivative 
y” being taken instead of the curvature [16], we have to take the accurate 
equation (24) instead of the approximate equation (25). Our problem now 
amounts to the following: to find the curve whose curvature is a given function 


of the abscissa, 


+ zsp: (39) 
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This is a second order differential equation: 
EERE 
(+ yh T? O 
On writing p = y’, we get a first order differential equation with variables 
separable: 
d 
P n = P (a) de, 


+ pja 
and integration gives us: 


x 
= [o@ar+c,, 
p S4 


y1 + 

whence 

x 

f p(x) dx + C, 

dy ss 40) 
p dx a x = (2), ( 
1—[ f 9(2)de+C,}? 
Xo 

and finally: 


x 
y =§ v(x) de + Oz. 
Xo 


For the case when the beam is supported rigidly at the end x = 0 and is 
subjected to concentrated loading at the other end x = l, we have [16]: 


(i — x)P 


M(z)=(1—a)P; ple) = “TF = %0 — a) (t= r) 


The equation becomes 


E EREE 
TF yi" 
with the initial conditions: 


2k(l — 2), 


Ylxug == 93 Yx = 0. 


On setting z, = 0 in (40), we must also set C, = 0 by the second initial 
condition, so that we now have: 


2k(l — x) d 
i J RU — 2) de ena 
da x = —(d— 
Jr-tfome—2 ace V ee uF 


a(2l — x) 
V1 — k? x? (2l — 2)? 


(41) 


We find y by integrating again and using the condition y |,_» = 0: 


x(2l — x) 
d 
j J : 
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The integral written is not expressible in terms of elementary functions. 
The curve corresponding to equation (41) is called an elastic curve. 
3. We take the equation: 
a yy” = (y — xy’), 
both sides of which are homogeneous functions of y, y’, y”. On substituting 
y= et dx 
we get: 
a? (u + u?) = (1 — xu)?, 
which gives us the linear equation for u: 
yf 2 1 
ul + Se F 0. 
Integration of this latter gives: 
u =x? (O,+2)=C,a% 4 a1, 


On substituting for u in the expression for y, we get: 
y= eC, x1 + logx +C 

or 
i À y = Cg eC, 
where we have written C, = ef and replaced (—0,) by C,. 

18. Systems of ordinary differential equations. A system o1 n first 
order equations with n unknown functions has the form, on solving 
with respect to the derivatives: 


3 
dy, = filz, Yis Yas. . > Ynh } 
Ta = fə( Yis Bpa )» | ( ) 


Ga Erle, Yr Yo- : Ynt | 


An existence and uniqueness theorem is valid as in the case of a single 
equation: if 
fi l2, Yo Yar Yn) (= 1,2,.. 65) 


are single-valued functions of their arguments, continuous for all x 
near x, and all yı near y$}, and with continuous first order partial 
derivatives with respect to yı there exists one and only one solution of 
system (42), 

Yi = É; (2); 


18] SYSTEMS OF ORDINARY DIFFERENTIAL EQUATIONS 59 
satisfying the initial conditions: 


Yı ls = ys Ye eee = ys 293 Yn [ee = yo . (43) 
We can vary the y{ in the initial conditions, so that the general 
solution of system (42) contains n arbitrary constants. Instead of 
appearing in the solution as initial values yf”, the arbitrary constants 
can also appear in the general form: 


Yi = Yi (z, Ci; Oas.. o Ca) (i= l, 2,...,%). (44) 
We obtain particular solutions of system (42) on assigning definite 
numerical values to the arbitrary constants C,, Oz, ..., Cn. To isolate 


the solution satisfying conditions (43) from this family, we have to 
determine the arbitrary constants from the equations 


Y = yi (2o Oy, Co... Cn) (6=1,2,...,0) (44,) 


and substitute the values obtained in (44). 

On solving equations (44) with respect to the arbitrary constants, 
we obtain formulae which give the general solution of the system in 
the form: 

Pi (Es Yrs Yass Yn) =O; (6 =1,2,..., 2), (45): 


with the essential proviso that these equations are soluble with 
respect to Yp Ya, -+ Yn. Any equation of set (45) is called an integral 
of system (42), and n such integrals have to be found to make up 
the general solution of the system; thus it follows that equations (45) 
must be soluble with respect to Y1, Yo, ---, Yn- 

We can re-write system (42) as a series of proportions: 


dz = dy, = dy, es 
Íi (2, Yo Ya- Yn) fe (2, Yas Yo- Yn) 


= dyn 
aa Ín (2, Yas Yor +++ Yn) 


(46) 


On multiplying all the denominators by the same factor, we get 

a function of variables z, y,, Yz ..-, Yn instead of unity in the denomi- 

nator of the first fraction. If we denote the variables as 2,, £o ..., 

Xn; Zn for the sake of symmetry, the system of differential equations 
(42) can be written in the form 

dz, _ dz, dr, Aën 


x, Xa Ri Mine 


, (47) 


where X,, Xz, ..-,Xn, Xn, are given functions of variables z, 
To +++) Xp, Tny The symmetry of the new form (47) of system (42) 


60 ORDINARY DIFFERENTIAL EQUATIONS [18 


is convenient for later discussions. In particular, which of the (n + 1) 
variables 2, £z, ..-, ny is to be taken as independent variable is no 
longer fixed with (47). The integrals (45) of the system become in 
the new notation: 


Pi (Tis Ta, Zati) = C; (i = l, 2, ...; n). (48) 


When the number of the arbitrary constants that appear in the 
solution (44) is determined, there must be no question of lowering 
this number. For instance, the three arbitrary constants in the 
formulae 


Yı = (Cy + Oa) £ H C3; Yo = Cya?; Ys = 2? + Oat + C, + C, 


can be reduced to two by putting C, + C, = C. The criterion for the 
impossibility of such reduction and for equations (44) giving the 
general solution of the system, consists in our being able to satisfy 
any initial conditions by suitable choice of arbitrary constants, 
i.e. in that system (44,) is soluble with respect to Oi, Oz ..., On 
for any choice of the initial values yf’ of the required functions. We 
assume here that the right-hand sides of equations (42) satisfy the 
conditions mentioned above. 

We now turn to a more detailed consideration of the integrals of 
the system. Suppose that we have k integrals of system (47): 


Pi (ti Tg e; Tny) = C(t = 1, 2, eee k). (49) 


The functions p(x, 2a -.-, Zn41) themselves, and not the equations 
(49), are sometimes referred to as integrals of the system, i.e. a 
function (x1, Ty ---, n4) i8 called an integral of the system if it becomes 
a constant on substituting in it any solution of the system. Of course 
it is assumed here that (2%, ta .--,%n4,) is not itself a constant. 
Since we can have what initial conditions we please for the solution, 
the values of this constant can be taken arbitrarily. If we make up 
an arbitrary function F(9,, Pp» ---, x) of the left-hand sides of 
equations (49), substitution of any solution of the system will make 
all the g;, and therefore the new function, constant, i.e. in addition 
to integrals (49) we have the integral 


F (9, Par sees ey) =C, (50) 


where F is an arbitrary function of its arguments. In other words: 
an arbitrary function of any integrals of the system is also an integral 
of the system. Equation (50) is not a new integral, being a consequence 
of integrals (49). 
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Suppose we have n integrals (48) of system (47). They are said to 
be independent, if equations (48) can be solved with respect to any n 
of the variables q, 2, ..-, %n4,. Such a solution gives us n functions 
of a single independent variable, i.e. formulae similar to (44), the 
formulae being solved with respect to the arbitrary constants in the 
form (48), i.e. n independent integrals (48) of the system are equivalent 
to the general solution of the system. It can be shown that the con- 
dition for the integrals (48) to be independent is equivalent to there 
being no one integral which is a consequence of the rest in the sense 
indicated above, or that there exists between the left-hand sides of 
equations (48) no relationship of the form 


D(H, Por e. Pn) = 0, 


which is an identity with respect to 2, ty ..., n4t 


We have given no test in the above by which we might judge whether integrals 
(48) are independent. Take the case n = 2: 


Pi (E Lo La) = O43 Py (Wy, Lo L3) = Cy. (51) 


If we recall the implicit function theorem of [I, 159], we can say that a 
sufficient condition for equations (51) to be soluble with respect to x, and 
x; is that the expression 


ay, p Op, 99. 
Ayy xs (Pi 2) IF ax, an, A an, an, 


should differ from zero. Similar statements apply as regards x, 2, and 2, 72. 
Assuming that p, and Ø, and their first order derivatives are continuous, it 
can be shown that the necessary and sufficient condition for the independence 
of integrals (51) is that at least one of the expressions 


Ay,» xs (Po Pz) Ayy x1 (Po Pab Axy xe (Po Pz) 


should not be identically zero. We discuss in Volume III the question of the 
independence of a system of functions with any number of variables. 


19. Examples. 1. We take the system: 


dz dy dz 
az ye PY) pa 
On cancelling out 1/z in the equation 
dz dy 
az yz 


we obtain an equation with separable variables, integration of which gives: 


log z=logy — 0 ie log = 0, 
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which is equivalent to 


y 
=o. 


We take as the second equation of the system 
NPs oe SE E 
wz) — (2? +y?) 
and use the solution obtained to substitute in it y = C, x. We have on cancel- 
ling out l/æ: 


da dz 


EETHEN i.e. (1 +02) xdg + zdz =0. 
1 


Integration gives: 


(1 + C2) a? +22 = 0; 
or, on substituting O, = y/x: 
a? + y? + 2%==0,. 


which is the second solution of the system. 
The two solutions of the system are therefore: 


L =C; attyt+e—Cy. (53) 


2. The system of differential equations of motion of a material particle of 

mass m under the action of a given force has the form: 
dx d2y d2z 

where X, Y, Z, the projections of the force on the coordinate axes, are dependent 
on time, the position of the particle, and its velocity, i.e. on the variables 
t, £, Y, Z, @’, Y’, 2’. 

On taking the derivatives x’, y’,z’ of x, y, z with respect to t as the unknowns, 
system (54) leads to the system of six first order equations: 


de 
dt 


dy ie 


Ay i, Fn gy Oe dy’ dz’ 
d 4} ae? di a 


z 
=X; m- Y; m- = Z. 


a 


= 23 


The general solution of this system contains six arbitrary constants, the 
determination of which requires the position of the particle and its velocity 
to be specified at the initial instant. 

The following three equations follow from equations (54): 


2 2 
(yi z e) =z zY 


dt? di? 
d?r d?z 
m(: Se -2-Gr] = 2X —aZ 


d?y ata 
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which can obviously be written in the form: 


zf dz -2-$L) = yZ 2Y 


ae "Ue dt 

d dz dz 

a mfl rx 55 
a m(: J 2] zX — zZ (55) 
d dy daz \ _ 

qrhe var) =k a. 


Let the force be centralized, i.e. always pass through some fixed point, 
called the centre, which we take as origin. Since the projections of a vector 
are proportional to its direction cosines, and the vector in the present case 
passes through the origin and the point (x, y, z), we have 


the right-hand sides of equations (55) now vanish, and we have the three integrals 
of system (54): 
dz dy) a. dz dz) n. 
m(y -p -=s m(2 GP -r-i =C 
(56) 


dy = 
m: (= ar Uae )=es 


They express the fact, familiar in mechanics, that the areal velocity of the 
projections of the moving particle on the coordinate planes is constant. 
It follows from equations (56) that 


Ox + Cy +0,2 =0, 


whence it is evident that the trajectory is a plane curve. The plane of the 
trajectory is obviously determined by the centre of the force and by the velocity 
vector at the initial instant. 

Now let X, Y, Z be partial derivatives of some function U, depending on 
x, y, z. We call U the potential of the force, whilst (— U) is the potential energy 
of the particle: 


ou au au 


aS da’ oy ’ Oz 


If we multiply the equations 
da OU d?y oU d?z ðU 


m ——-— = 


dë d? de Oy? aie = Se 
by da/dt, dy/dt, dz/dt and add, we get: 


dz da dy dty dz dz) dU 
Oar a a ae ar ae ar 


or 


ae Car) + (at) + Ge) ]-4r- 
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whence we obtain the integral 


P= 7 =0, (57) 


o m dv \? dy \? dz\?7]_ 1 
Dis [a +a) + (a7) os 
is the kinetic energy of the particle. 
Equation (57) expresses the fact that the sum of the kinetic energy T and 
the potential energy (—U) is constant throughout the time of the motion. 
3. We consider a system of n particles, inter-related in such a way that the 


coordinates of any given particle are defined as functions of the independent 
parameters qı, Q,---,Q%, and of time t: 


where 


Ti = Pi (Gry Gare got) Yi = Vi (I Iv- lwt) Zi = Oi (Jo Tore + Met) 
(¢= 1, 2,..., 2). (58) 
Let the system be acted on by forces of potential U, depending only on the 
position of the particles; then the projections X;, Y;, Z;, of the forces acting 
on the ith particle on the coordinate axes, are the partial derivatives of U 
with respect to %; Yip Z; Let the masses of the particles be M, Mo, ... Mp- 
By using equations (58), we can write the kinetic energy: 
n 

Se S| (eee (ey Ea 

aa 2 ( at) + dt + dt 
and the function U, in terms of parameters q,, q2, .--,%,, the motion of the 


system being then defined, as is well known from mechanics, by the following 
Lagrangian equations: 


d (ƏT oT aU 
ar (ag) qs gs ee oa 
The function T is clearly a second degree polynomial in the derivatives 
qi da» +++» 9k Of the parameters with respect to time, and (59) consists of k 


second order equations, which is equivalent to 2k first order equations; inte- 
gration of equations (59) gives us expressions for the q, as functions of ¢ and 
of 2k arbitrary constants. 

Let us suppose that equations (58) do not contain ¢. Then ż¢ will also be absent 
from T and U. We multiply equations (59) by lA qy» .- -, qk respectively and 
add: 


k d ( er k eT dU 
4 é = ; 60 
PA Is “ay a) 2% GIA dt (60) 


We notice the obvious equality: 


k a (2) E er a & ar 
Sa [aS otis am r 
2s a g 2% dq; ae 2 as 


19] EXAMPLES 65 


In the present case, T' is a homogeneous polynomial in q; and 
k , 3T 
$a = 2T, 
Ži x 09s 


by Euler’s theorem regarding homogeneous functions [I, 154]. Hence 


Saud OP) Sg OF ar aT _ aT 
» 6 ar (eg As iz di di at’ 
and (60) gives us: 

aT _ daU 

dt dt’ 


whence the integral of (59) is obtained (total energy integral): 
T—U=0. 


4. Knowledge of the integral of the differential equations of motion of a 
system sometimes enables us to solve the problem of the stability of small 
oscillations of the system about the position of equilibrium. We state the 
problem mathematically whilst simplifying the discussion by confining ourselves 
to the case of three unknowns 2, y, z, which satisfy the system of differential 
equations:f 


dæ  ,, dy _ „dz 
dt = dt = dt SA (61) 


where X, Y, Z are known functions of x, y, z, and t, vanishing for 
v =y =z=0. (62) 


With this, system (61) has the obvious solution (62), which corresponds to 
the position of equilibrium. The position of equilibrium [or simply solution 
(62)] is said to be stable if, for any given positive g, there exists an 7 such that, 
for any solution of system (61) satisfying the initial conditions: 


Tlia = Xs Ylteo =Y Zlino = Zo 


we have 
lx], lyi, and |z| <e, (63) 


provided only that 
l£ob lyol and |zo] <n- (64) 
Let system (61) have an integral 
plz, y, z) =0, (65) 


not containing t, and such that the function p(x, y, z) has a maximum or a 
minimum for x = y = z = 0. We show that, with this, the position of equi- 


t There are six unknowns in the case of the motion of a single material 
particle. 
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librium is stable. By changing the sign of p if necessary, we can assume that it 
has a minimum; and by adding a constant to gy, we can assume that the minimum 
is zero, 

Function y now vanishes at the point x = y = z = 0 and is positive at all 
points (x, y, z) near to, but not at, (0,0, 0). We take a cube ô, with centre 
at the origin and with a side of length 2e. The continuous function p is positive 
at the surface of the cube and therefore attains a least positive value m, so 
that over all the surface 


g>m>o. (66) 


We now take a concentric cube by about the origin, with length of side 
27, such that the inequality is valid within the cube 


p<m, (67) 


which is possible since (0, 0, 0) = 0. Let the point (x, y, z) be situated inside 
cube 6, at the initial instant, i.e. condition (64) is fulfilled. Inequality (67) 
will be valid not only at the initial instant, but throughout the time of the 
motion. By (65), in fact, p preserves the constant value C during the motion. 
But given this fact, point (x, y, 2) cannot cross the boundary of cube 6, during 
the motion, since inequality (66) must apply at the boundary, which contra- 
dicts (67); condition (63) must thus be satisfied for all ¢ > 0, which is what 
we required to prove. 

The unknowns 2, y, z can be any geometrical or mechanical values, and 
we only considered them as the coordinates of a point for the sake of clarity 
of proof. Suppose, for instance, that T' and U in equations (59) do not contain 
time ż, so that the total energy integral is valid. Let the equations apply for 
q; = 0 (s = 1,2, ..., k): 


ðU ðU aU 0 
Og, Bg Ne 
Equations (59) now have the evident solution: 
9s = 9s = 0, (68) 


to which the position of equilibrium of the system corresponds. If it also happens 
that the potential energy (— U) is a minimum for the g, = 0, we can assert 
that (T — U) is also a minimum for values (68), since T, which cannot be 
negative, now vanishes, i.e. is also a minimum. Hence we see that the position 
of equilibrium corresponding to minimum potential energy is stable with respect 
to the g, and q; (Lagrange—Dirichlet theorem). 


20. Systems of equations and equations of higher orders. We con- 
sider the relationship between a system of first order differential 
equations and a single higher order equation. If we have, for example, 
one differential equation of the third order: 


y” = f(x,y, yy"), 
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we can replace it on writing y= Y Y’ = Yp Y” = Ya, by a system 
of three equations of the first order: 


dy, ats 


daz = = Ya; a Y3; 


= f(x, Yi Yo, Ya). 


We have already carried out a similar substitution in [14]. Likewise, 
if we are given a system of two second order equations, for instance: 


y” =f, (x, Y Y’, 2, z’); 2” = fa (8, Y, Y’, 2, z’), 
where y and z are required as functions of x, we can replace this by a 
system of four first order equations; here we introduce the four 
required functions: y = Y; Y = Yz Z = Yz Z = Ya 
The first system above can be written in the form: 


d 

LA = Ya; = fı (£, Yn Ya Ya Ya): 
d 

is = Ya; = fe (£, Yi Yas Ys Ya) 


Conversely, we show that integration of a system can in general 
lead to integration of a single higher order equation. We shall only 
consider the case of a system of three first order equations solved 
with respect to the derivatives: 


= fi (£, Yo Yo Yah Y= falt, Y Yor Ya); 
(69) 
Y3 = fa (£, Yr, Yz Ys)- 


Let the first equation contain y, and let us solve for this: 


Y: = 0 (2, Yi Yi Ys). (70) 


On substituting in the remaining two equations of the system 
we shall obtain equations of the form: 


a a bo y 


öy I 


oo i 


WTS -ys + = Fa (£, Yo Yo Ys); 


Ys = Va (£, Yr Yi Ys). 


On substituting for yg from the second equation in the first, then 
solving the first equation for yj, we get a system of two equations 
with two required functions y, and y}, of the form: 


i= P(T, Yo Yi Ya)3 Y= VD Yr Yr Y). (71) 
Let the first equation contain y,; we solve for this and get 


Ys = 3 (X, Yr, Yr Yi) (72) 


68 ORDINARY DIFFERENTIAL EQUATIONS [21 


on substituting in the second of equations (71), we obtain a third 
order equation in y,, which may be written as: 


yy F (a, Yi Yi yi). (73) 
Suppose that we have managed to integrate this equation: 
Yi = D(x, Ci» C, Cs). 


We obtain y, on substituting in equation (72); if we then sub- 
stitute in (70), we obtain y,, without further integration. If the first 
of equations (71) does not contain y,, we already have a second order 
equation for y,, and its general solution will contain two arbitrary 
constants. On substituting this general solution in the second of 
equations (71), we get a first order equation for y,, and integration 
of this introduces a third arbitrary constant. Finally, y, is determined 
from (70) without further integration. 


21. Linear partial differential equations. We have so far considered 
differential equations containing derivatives of functions of a single 
independent variable. As already mentioned, such equations are 
called ordinary differential equations. We now consider a class of 
partial differential equations which is directly related to the theory 
of systems of ordinary differential equations. 

We return to the system of differential equations (47): 


dz, _ dz, 


=...=>5>—- 74 
xX, xX, Xma ( ) 
An equation 
P (Lis Los -oes Tny) =C 
or a function 9(2,, %, ..., ny), not identically constant, is called an 


integral of system (74) if, on substituting in it any solution of the 
system obtained in accordance with the existence and uniqueness 
theorem, we obtain a constant. 

Thus, let x, be the independent variable, and £}, 73, ..., %n4, be 
functions of x, representing a solution of system (74). A constant 
must be obtained on substituting these functions in the expression 
P(t Voy +++) Tny), ie. the independent variable must go out as a 
result of substitution; hence the total differential with respect to 2, 
must be equal to zero [I, 69]: 
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or 
an, th 4 oe e de, +... +5 a 7 Ont = =0. (75) 


But the differentials is. must be ae to the Xs, since we 
have substituted a solution of system (74); hence we obtain the 
following equation for p, on replacing the dz, in (75) by the pro- 
portional X,: 


dy Op ap 
Zig, + Kage, tes + Xan ge = 9 (76) 


The function (%, £o ...,;%4,) must satisfy this equation inde- 
pendently of the precise nature of the solution of system (74) that 
we have substituted in the function. If we take all the solutions of 
(74), we can give variables 2,, 7, ..., %n4, whatever values we please, 
in view of the arbitrariness of the initial conditions in the existence 
and uniqueness theorem, i.e. 9(%,, %, .--,; Xn4,) must satisfy equation 
(76) as an identity in (%4, £o ..., %n4,). Hence we obtain the following 
theorem. 

THEOREM. If (£i £o ---, Una) =C is an integral of system (74), 
the function (£, £a -.-,%n4,) must satisfy the partial differential 
equation (76). 

The converse is easily proved. 

THEOREM. If (2, £o ..., 241) ts any solution of equation (76), 
Plts La . ++, Iny) =C is an integral of system (74). 

We need only substitute any solution of system (74) in (2, 2%, ..., 
ny) and take the total differential: 


dp (4, Tey vee, Ln4i) = ny dt + ae dr, +.. + a dtn. 


Since a solution has been substituted, the dx, can be replaced by 
the proportional X, by (74), i.e. we write dz, = à X, where 2 is a 
coefficient of proportionality. Hence: 


3 ə 
dg (x, Tip soie n41) =4(X, x + X, a +o. t Xnt Geer): 


But by hypothesis, y satisfies equation (76) identically in z4, £a ..., 
Ln S0 that d¢(a,, %, ...,%n4,) = 0. The expression for the first 
order differential is independent of whether the variables are inde- 
pendent or not [I, 153]. In the present case, after substituting a 
solution of the system, pọ will be a function of a single independent 
variable, say æ; the differential of the function » was equal to zero, 
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i.e. the derivative with respect to x, (after substitution) is identically 
zero, so that y no longer depends on q, i.e. is constant. It follows 
from this that ọ is an integral of the system, which is what we required 
to prove. 

The two theorems just proved establish the equivalence of an 
integral of system (74) and a solution of the partial differential equa- 
tion (76). If 


Pı = Cys Pa = Og; «63 Pk = Cy 


are k integrals of the system, the arbitrary function F(p, Pa, -.-, Px) 
is also an integral of the system, as we have seen, and we can therefore 
assert that an arbitrary function of any solutions of equation (76) is 
also a solution of the equation. If 


Pı (ro La, s Troi) = Cy; 605 Pnl Da -.., Tn41) = Cp (77) 


are n independent integrals of system (74), the arbitrary function 
F (1 Po +++) Pn) is a solution of equation (76). 

This can be verified directly by substituting p = F(9,, Po ..-; Gn) 
in equation (76) and noting the fact that functions 9, Pa ..-, Øn 
satisfy the equation. We do not dwell on the proof of the fact that 
this is the general solution of equation (76). The following rule is 
obtained for integrating (76): to find the general solution of the linear 
partial differential equation (76), we must form the corresponding 
system of ordinary differential equations (74) then obtain n independent 
integrals (77) for the system; the general solution of (76) is then 


P= F (Gy, Po <., Pn)s 


where F is an arbitrary function of its n arguments. 

A linear partial differential equation of the form (76) has two 
characteristics: its coefficients X; do not contain the required function 
ọ and its free term is zero. The general case of a linear equation has 
the form 


ag ay oy =, 
Y Go, tirga Teet Engs Fats =9, (78) 


where Y,, Yo, ..., Yn4, contain £, £z ...,%, and p. We seek the 
family of solutions of equation (78) as the implicit function 


© (14, Tg ++ Tp VY) =C, (79) 
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where C is an arbitrary constant. By the rule for differentiation of 
implicit functions: 


Ow 
dy ri, 
a Do’ 

“Op 


on substituting in (78), we get the equation for w: 
ð ð 
Yra t Yig Poo Yng Yng =O (9) 


which has the two characteristics indicated above. We note that the 
variables 21, 2%, ..., &n, p can have any values in view of the arbitra- 
riness of C in (79,), and hence it follows, as above, that equation (79,) 
must be satisfied identically with respect to %,%, ..., Zn @. Its 
solution leads to integration of the corresponding system of ordinary 
equations. Having found œ, (79,) gives us g. It can be shown that, 
given certain general assumptions regarding the Yx, all the solutions 
of equation (78) can be found in this way. 

We notice that the general solution of a partial differential equation 
contains an arbitrary function, whilst only arbitrary constants appear 
in the general solutions of ordinary differential equations. 

We consider linear partial differential equations in more detail in 
Volume IV, andestablish the corresponding existence and uniqueness 
theorem. 


22, Geometrical interpretation. We give a geometrical interpretation 
of the above theory in the case of three variables. Suppose we have a 
tangent field in three-dimensional space, i.e. a direction is defined 
for each point of the space. On taking any system of rectilinear 
axes, every direction (or tangent) is defined by three numbers, 
proportional to the direction cosines of the tangent, i.e. the cosines 
of the angles formed by the tangent with the coordinate axes. Generally 
speaking, we have different tangents at different points, and the 
complete tangent field is defined by three functions: 


u(x, Y, 2), v(%, Y, z), Wa, Y, 2), (80) 


such that the direction cosines of the tangent at a given point (2, y, z) 
are proportional to magnitudes (80). 

We consider the same problem as in the case of a first order equation, 
that of finding the curves in space whose tangents are those defined 
by the field. We know from [I, 160] that the direction cosines of a 


72 ORDINARY DIFFERENTIAL EQUATIONS [22 


tangent are proportional to dz, dy, dz, whilst if two directions coin- 
cide, quantities proportional to their direction cosines must themselves 
be proportional, i.e. we have the following system of differential 
equations for obtaining the required curves in space: 


dx _ dy dz 
u(x, Y, z) v(@, y, z) wa, y, z)’ 


(81) 


Integration of this system amounts to finding its two independent 
integrals: 


Pı (x, yY, z= Ci; Pa (£, Y, 2) = C , (82) 


ie. such that equations (82) are soluble with respect to any two 
variables. These two equations define a certain curve in space [I, 160]; 
we obtain a family of integral curves of system (81) on assigning 
various numerical values to C, and C,. Initial conditions amount to 
specifying that the required curve should pass through a given point 
(£o Yo Zo). Lhe arbitrary constants C,, C, are determined by the initial 
conditions. 

We now turn to the geometrical interpretation of the linear partial 
differential equation. We again take functions (80) as defining a 
certain tangent field, as above. It is required to find a surface such 
that, given any point of it, the corresponding tangent plane contains 
the direction defined by the field at the point. Let the equation of a 
family of the required surfaces be: 


g(x, y, z2)=C. 


From [I, 160], the direction cosines of normals to these surfaces are 
proportional to dy/dx, dg/dy, Əp/Əz, whilst the direction of the normal 
must be perpendicular to the direction defined by magnitudes (80), in 
order that this latter may lie in the tangent plane. We apply the usual 
condition for two lines to be perpendicular [I, 160], and obtain a 
linear partial differential equation for determining ¢: 


3 0 0 
ule, y, 2) Ze + O(a, y 2) Ge + (a, Y,2) B= 0. (83) 


The system of ordinary differential equations corresponding to this 
last equation is (81), so that the general solution of (83) has the form: 
p = F (pi, pa), 


whilst the general equation of the required surfaces is 


F (Pi; P2) = 9, (84) 
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where F is an arbitrary function of its arguments. We do not need to 
write an arbitrary constant C in view of the arbitrariness of func- 
tion F, whilst gy, and p, are the two independent integrals (82) of 
system (81). If we make a definite choice of function F, surface (84) 
will evidently be the locus of the integral curves of system (81) on 
which the values of the constants in equations (82) are connected by 
the relationship: 

F(C, Ca) = 0. (85) 


The solution of equation (83) is generally speaking made precise 
if we stipulate that the required surface should pass through a given 
curve in space (L). The stipulation represents initial conditions for 
partial differential equation (83). The required surface will evidently 
be composed of the integral curves of system (81) which start from 
points of the curve (LZ), i.e. the initial conditions of which are deter- 
mined by the coordinates of points of (L). We obtain a definite surface 
in this way, in view of the existence and uniqueness theorem for 
system (81). This excludes the case when (L) is itself an integral 
curve of system (81), when the above procedure leads us to (L) itself 
and not to a surface. 

It can be shown that in general an infinite set of surfaces g = 0 
passes through the curve (L), where ¢ satisfies equation (83). A de- 
tailed discussion will be found in Volume IV. 

Let the equation of (L) be given as a set of two equations: 


Yı (T, yY,z) = 0; Pe (x, Y, 2) = 0. (86) 


If we eliminate variables x, y, z from the four equations (82) and 
(86), we obtain a relationship between C, and C, which, by (85), 
also determines the form that function F must take in order that equa- 
tion (84) may give the required surface passing through curve (86). 


23. Examples. 1. We consider the partial differential equation: 


id 3p Sa ec ae 
az -oy tyg e ty’) -az = 0. (87) 
The corresponding system of ordinary differential equations is: 
da dy dz 
ae ye ety as 
We found its two independent integrals in [19] above: 
a Cy w+ y?+227=C,. (89) 


g£ 
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The first equation gives a family of planes passing through the z axis, whilst 
the second gives a family of spheres with centres at the origin. The integral 
curves of system (88) will be a family of circles lying in these planes with their 
centres at the origin. The general solution of equation (87) is 


p= F(L, waves). 00 


where F is an arbitrary function of its two arguments. Let us find the form 
of F such that the surface 


F(t, at fy? +2!) =0 (91) 
passes through the straight line 


v=]; y=z. (92) 


We eliminate x, y and z from equations (89) and (92). The first of equations 
(89) gives, together with (92): 


s=]; y=0; 2= 0; 


substitution in the second of equations (89) now gives the relationship between 
C, and 0;: 


14203 -—C,=0, ie. F(C,,C,) =1+4 20} —C,. 


With this form of function F, (91) becomes the equation of the required 
surface: 


142% — (att yttet)=0 or at 2yt— ater t yt tt) <0. 


2. Let the tangent field defined by a system of differential equations be 
such that its direction is the same at all points of space. Let (a, b, c) be numbers 
proportional to the directioncosines of this fixed direction. The system of 
differential equations will be: 

dg dy dz 


= — se or cdx —adz==0; cdy—bdz=0, 
a b c 


which yields at once the two integrals: 
cx —az=0,; cy— bz =O). 


The integral curves are obviously parallel straight lines with the fixed direc- 
tion referred to. The corresponding partial differential equation 


Op Op Op 
a +o- +e 0 (93) 


0z 


defines the surfaces (x, y, z) = 0, representing the locus of certain of these 
straight lines, i.e. (93) is the equation of certain cylindrical surfaces. Its general 
solution has the form: 


g = F(cx — az, cy — bz), 
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where F is an arbitrary function, and the general equation of the cylindrical 
surfaces whose generators have the fixed direction is 


F(cx — az, cy — bz) = 0. 


3. Let the tangent field be such that its direction at any given point 
M(æ, y, z) coincides with the direction of the radius vector from a fixed point 
A(a, b, c) to the point M(x, y, z). The projections of the vector on the coordinate 
axes are 


x—a, y—b, z-e¢ 
and these quantities are therefore proportional to the direction cosines of the 
given direction at M. The corresponding system of differential equations is 


dx _—s dy dz 
Z-a@ y—b z=c 


and we have the two obvious integrals: 


It is geometrically obvious that the family of straight lines passing through 
A(a, b, c) is a family of integral curves. The corresponding partial differential 
equation 

op 


Op op, = 
egy Pe) a Pe ee 8 


defines conical surfaces with vertex at A, the general equation of these surfaces 
being 
r(==% a 
z—c z—e 


where F is an arbitrary function of its two arguments. 

We remark that generally only one conical surface can be drawn through 
a given curve in space (L), generated by the radius vectors from the point A 
to points of (L). If, however, (L) is one of the integral curves of the system, 
i.o. is a straight line passing through the point A, an infinite set of conical 
surfaces can be drawn to contain (L). 

4. We take another system of differential equations of the form: 

da dy dz 


cy—bz  az—ca  bæ— ay ` (94) 


On equating all three ratios to the differential dt of a new variable t, we can 
write: 

dx = (cy — bz) dt; dy = (az — cx) dt; dz = (bx — ay) dt. (95) 
Hence two equations are easily obtained, integrable directly. The first equation 
is obtained by multiplying equations (95) respectively by a, b, c and adding, 
whilst for the second we multiply equations (95) by 2, y, z respectively then 
add. This gives us the two equations: 

adx + bdy + cdz =0, vds + ydy + zdz =0, 
integration of which yields the two integrals of the system: 
ax + by + ez =C; -+y +2? = C, (96) 
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The first integral gives a family of parallel planes, the direction cosines 
of the normals to which are proportional to the numbers (a, b, c). The second 
integral gives a family of spheres with centres at the origin. The intersections 
of these planes and spheres represent the family of integral curves of system 
(94), which evidently consists of circles lying on the planes and with centres 


on the straight line 
x 


a 


Yy 
ig eet (97) 
which in turn passes through the origin and is perpendicular to all the planes. 
It can easily be shown that the aes partial differential equation 


ə 0 
(cy — bz) a + (az — cz) 2 rF (bu — ay) a =0 


defines the surfaces of revolution which have (97) as axis of revolution, the 
general equation of these surfaces being 


F(ax -+ by + cz, x? +y? +z?) =0, 


where F is an arbitrary function of its two arguments. We remark that the 
form of the denominators in system (97) could be obtained from geometrical 
considerations by suitably specifying the tangent field as was done in pre- 
vious examples. 

5. The problem of orthogonal trajectories in space leads to a linear partial 
differential equation. Suppose we are given a family of surfaces 


w (x,y,z) =O, (98) 


dependent on the parameter C, so that, in general, one and only one surface 
of the family passes through every point in space. We require to find the 


surface 
g(x, Y, z) = Ci» (99) 


which intersects all the surfaces (98) at right angles. The condition that the 
normals to surfaces (99) and (98) should be perpendicular gives us a linear partial 
differential equation for the required function 9: 


ðw Op Ow Op 3w 0p 


“Ox. Oa T By ay + z “Oz, ae 


The corresponding system of ordinary equations: 


da dy dz 
do dw dw (100) 
Oa Oy dz 


defines the curves, whose tangents at every point are normal to surfaces (98) 
passing through the point concerned. If 


Pı (£, Y, 2) = C1; He (x, Y, 2) = Cy 


are two independent integrals of system (100), the equation of the required 
surfaces has the form: 


F(p, pa) = 9. 


CHAPTER II 


LINEAR DIFFERENTIAL EQUATIONS. 
SUPPLEMENTARY REMARKS ON THE THEORY 
OF DIFFERENTIAL EQUATIONS 


§ 3. General theory; 


equations with constant coefficients 


24. Linear homogeneous equations of the second order. The simplest 
part of the theory of differential equations is that dealing with linear 
equations; these have received the most detailed treatment and are 
the most commonly encountered in applications. We dealt with the 
solution of linear equations of the first order in [4]. We consider 
linear equations of any order in the present chapter, starting with 
those of the second order. 

An equation of the form 


Py)=y" + plz)y' + a(x) y = 0, (1) 


is called a linear homogeneous equation of the second order, where 
the left-hand side is denoted by P(y) for brevity. 

It follows from the linearity of P(y) with respect to the function y 
and its derivatives that, given arbitrary constants C, C, and O, 


P(Cy) = CP(y)s P(Ciy, + Cz ye) = Cy Ply,) + Cz P(Y). 


If y = y, is a solution of the equation, P(y,) = 0, and obviously 
P(Cy,) = 0, so that y = Cy, is also a solution. Similarly, if y, and y, 
are solutions, 


y = Ci Yı + Cry, (2) 


is also a solution, with arbitrary constants C,, C,. Thus, further solutions 
of the linear homogeneous equation (1) can be obtained by multiplying 


G7 
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existing solutions by arbitrary constants and adding. It is obvious that: 
linear homogeneous equations of any order will possess the same 
property. When we refer below to a solution of equation (1), it will 
be assumed to differ from the trivial solution y = 0. 

The existence and uniqueness theorem can be stated very simply 
for equation (1), as we prove in a later paragraph: if the functions 
p(x) and q(x) are continuous in the interval a < x < b, and if 2, is 
any x belonging to the interval, there exists one and only one solution 
of equation (1) satisfying the initial conditions 


Yl eax = Yo Of ae = Yor 


where Ya and yo are any given numbers. This solution exists throughout 
the interval a < x <b. 

We shall in future consider the solutions of equation (1) with æ 
varying in the interval of continuity of p(x) and q(x). In view of the 
arbitrariness of £o, Yọ Yo in the existence and uniqueness theorem, 
equation (1) has no singular solutions. 

Two solutions y, and y, of equation (1) are said to be linearly 
independent if no identity with respect to x exists of the form 


a Yı + 2Y, = 0, (3) 


where a, and a, are non-zero constant coefficients. In other words, 
the linear independence of y, and y, implies that the ratio y,/y, is 
not a constant, i.e. that the derivative of the ratio 


d a Yr Y2 — Yo Yi 
— — pT 4 
dx a Yi (4) 


is not identically zero. 
We introduce into the discussion the expression 


AY; Yo) = Y1 Y2 — Y2Yv (5) 


called the Wronskian of the solutions y, and y} A characteristic of 
the Wronskian is that: 


Alyy, Ya) = Ape H?O™, (6) 
where 4, is a constant, equal to the value of A(y;, Y2) at x = Zp. 


We prove this by finding the derivative: 


dA > , + a £. td n a” a 
AA w yy + yh — LY — Ye Y= YE — Yat 
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Since y, and y, are solutions of equation (1), we can write: 


yi + plz) yi + gl£) yı =0; yz + p£) yz + gl£) Yz = 0. 
We multiply the first equation by (—y,) and the second by yı 
and add: 
Yi Y2 — Y2 Y1 + P(Z) (Y Y2 — Yai) = 0 
so that 
dA (yy 
Luh 4 pia) A (E Yo) = 0. (7) 


This is a linear homogeneous equation in 4, and we obtain (6) at 
once on applying (31,) of [4]. 

It follows from this formula that A(y,, Yı) is either identically zero, 
if the constant A, is zero, or is non-zero for all values of x, since the 
exponential function does not vanish. We assume here that p(z) is 
continuous. 

By (6), we can write instead of (4): 


— F p(x) dx 
© % 


r Ye H) = Alyo Y) 

de \ y, yi en 

and hence it follows that two solutions y, and y, of equation (1) are 
linearly independent when, and only when, Aly, Ya) differs from zero, 
i.e. when A, ~ 0. 

We now show that, if y, and y, are linearly independent solutions 
of equation (1), (2) gives us, with suitable choice of constants C, 
and C,, the solution of (1) satisfying any previously assigned initial 
conditions: 

Ylx=xo = Yo gprs Yo- (8) 

Let Y10 Y20 Yio Y29 denote the values of y, and y, and their derivatives 
for x = xo. To satisfy the initial conditions (8), we have to determine 
the C, and C, in (2) from the system of equations 


CiYio + C2Yz0 =Y Ci Yio + C2 Y20 = Yo- 
It follows from the linear independence of y,, Y, that 


Ay = Yio Y20 — Yoo Yio ¥ O, 
so that the system written gives us fully defined values for C, and C,, 
which proves our assertion. 
But by the existence and uniqueness theorem [3], every solution 
of equation (1) is fully defined by its initial conditions, and we can 
therefore state the following proposition: if y, and y, are two linearly 
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independent solutions of equation (1), all the solutions of the equation 
are given by (2). 

The problem of integrating (1) thus reduces to finding two linearly 
independent solutions. Let y, be a solution, and y, any other solution. 
We get by integrating relationship (7): 


-5P dx q - {Pw dx 
dz 
i (9) 


= Afe A or n=4y |e T 


thus, if a particular solution of equation (1) is known, its second 
solution can be found by using (9), where 4, is a constant which can 
be set equal to unity. 

It must be remarked that it proves impossible to find this solution 
explicitly, or even with the aid of a quadrature, in the general 
case when p(x) and g(x) are functions of x. We shall see, however, 
that the solutions are obtainable explicitly in some particular 
cases, including that when p(x) and q(x) are constants, and not 
functions of z. 

We also give later a method of constructing solutions which is 
often used in applications, viz., the construction as an infinite series. 


25. Non-homogeneous linear equations of the second order. An 
equation of the form: 


u” + pla) wu’ + g(a) u = f(x). (10) 
is called a non-homogeneous linear equation of the second order. 
If p(x), q(x) and f(x) are continuous in an interval a < x < b, 
we have, as will be shown later, exactly the same existence and 
uniqueness theorem as for the homogeneous equation (1). Below, 
we shall consider the solutions of equation (10) in the interval of 
continuity of p(x), g(x) and f(x). 
Let u = u, be a solution of the equation, so that: 


ui + p(z) u; + g(x) u, = f(x). (11) 
On introducing a new function y instead of u: 
u = y + u. (12) 


and substituting in (10), we get: 


[y" + play’ +(e) y] + [ui + p (2) u + g (a) u] =F (2), 
or, by (11), 
y” + play +a(z)y=9. (13) 
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This last equation is called the homogeneous equation corresponding 
to equation (10). If y, and y, are two linearly independent solutions 
of (13), we have, by (12) and the proposition of the previous article, 
the formula: 


= O19, + Cz Y2 + t, 


where C, and C, are arbitrary constants, giving all the solutions of 
equation (10). The property can be stated thus: the general solution 
of a non-homogeneous linear equation of the second order is equal to 
the sum of the general solution of the corresponding homogeneous equation 
and any solution of the non-homogeneous equation. 

The above proof is obviously also applicable to non-homogeneous 
linear equations of any order, so that these possess the same property. 

The knowledge of two linearly independent solutions of the homo- 
geneous equation (13) enables us, as we shall now see, to find a parti- 
cular solution of equation (10), and hence its general solution. We use 
here the method known as Lagrange’s method of varying the arbitrary 
constants [4]. 

Let y, and y be two linearly independent solutions of (13). The 
general solution is expressed by (2), as we know. 

We shall seek a solution of (10) in the same form, except for taking 
C, and C, as required functions of x instead of as constants: 


u = V (£) Yı + Va (T) Yo. (14) 


Since we have two required functions, and not just one, we can 
subject v(x) and v(x) to a further condition, apart from (10). We lay 
down the following condition: 


vi (z) Yı + v2 (2) Y = 0. (15) 
On differentiating (14) and using (15), we obtain: 
q (x) |u = v; (£) Yı + Va (T) Yz 
p(z) jw = A 
1+ {u” = 0 (x) Yi + Va (2) Yz + vi (x) yi + va (z) Yz. 
We substitute in the left-hand side of (10), and get: 


(x) [yi + p(x) yi + 9 (a) y] + v(x) [yz + p(x) yz + a (2) y] + 
+ vi (z) yi + v (x) Yz = f(x). 
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Bearing in mind that y, and y, are solutions of homogeneous 
equation (13), and recalling (15), we have the system of equations 


vi (©) Yı + Vz (2) Yo = 0; vi (z) yi + 02 (2) Yz = f (2) (16) 
for determining vi(x) and v4(z). 
By the linear independence of solutions y, and y, 
A (Yi Yo) =Y 92 — Yi #O, 
so that system (16) fully defines vi(x) and vz(x). We find v(x) and 


v(x) by carrying out the integrations, then substitute in (14) and 
obtain the solution of equation (10). 


26. Linear equations of higher orders. Higher order linear equations 
possess many of the properties of second order equations. We state 
these without dwelling on their proof. 

An equation of the form 


YP + pi (x) yd + py (x) yO) + Ppa (a) y’ + 


+ Pn(X)y = 0. (17) 
is called a linear homogeneous equation of the n-th order. 
If Yis Ya, --+, Yk are solutions, the sum 
C19 + OY te. + On 
where Oi, O» ...,C, are arbitrary constants, is also a solution. The 


proof of this is exactly as for second order equations [24]. 
The statement of the existence and uniqueness theorem is also as 
for second order equations, with the initial conditions taking the form: 


y has = Yo; y pE a Yo; sees yr) ey = yf. 


Solutions 4,, Yə» +++» Yk are said to be linearly independent if there 
exists no identity in æ of the form: 


a Yı + Og Yq +... + OY, = 0 


with constant coefficients a,, Ag, ...,a@,, not all of which are zero. 
If Yi, Yo, +++» Yn are n linearly independent solutions of the equation, 
all the solutions are given by the formula: 


Y = Ci Yı F CY t+... +Cnyn, (18) 


where the C; are arbitrary constants. The solution satisfying the initial 
conditions given above can be obtained by suitable choice of the C;. 
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A non-homogeneous linear differential equation of the n-th order has 
the form: 
u™® +p, (2) u-d + py (x) u-d +... + 
+ Pn (2) W + pn (2) u = f (2). (19) 
If u, is a solution of this equation, and y,, Yz ---, Yn are linearly 
independent solutions of the corresponding homogeneous equation 
(17), the general solution of (19) is given by 
= O19, + Coe +--+ OnYn t+ W, 


where the C; are arbitrary constants. 
With this, if Y}, Ya» .--, Yn are known, the solution of (19) can be 
found from the expression: 


u = Vy (£) Y, + Ug (L)Yy +.. -H Vrt) Yn, 


where the v'i(x) are determined from the system of first degree equa- 
tions: 
Vi (£) Yı + Va (L) Ya +... + Va (2) Yn = 0 
vi (2) yi + 02 (2) yz +.. -+ On (2) Ym = 0 
KED + of a) 9-2 4... + oh (a) yh'-2 —0 
vj (æ) YY + 0; (@) YL +... + vn (2) YY = f (2). 
It may be pointed out, to the reader familiar with determinants, that the 
necessary and sufficient condition for linear independence is precisely analogous 
to that given above for second order equations. Let y,, Yz, ...,Y, be solutions 


of (17) as before. The following nth order determinant is called the Wronskian 
of these solutions: 


Yi Yrs Yn 
F Yi Yh Yn 
(Uo Yme W= lyf ye, wR 


YED, YD... yg- 


and an expression can be derived for this, analogous to (6): 


z 
Alun tay Yq) = dp TAO, 
where A, is the value of A at x = x,. It follows from this expression, as above, 
that 4 is either identically zero, or does not vanish for any value of x. A neces- 
sary and sufficient condition for the linear independence of solutions y,, Yz 
.. Yk is that the Wronskian should not be identically zero. With this, the 
arbitrary constants of (18) are fully defined for any initial conditions. As in 
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the case of second order equations, the existence and uniqueness theorem 
gives the solution throughout the interval of continuity of the coefficients 
p(z), P(x), ..., P(x) of the equation. 


27. Homogeneous equations of the second order with constant 
coefficients. Before dealing with equations with constant coefficients, 
we prove a formula of the differential calculus which will be required 
later. We are familiar with the formula for the derivative of the 
function e™, where r is a real number: 


(e) = re™, 


We prove that the same formula applies when r is complex and x 
is the usual real variable, i.e. 


(e(a+bx)’ pae (a 4 bi) e(atbhx, 


It follows from the definition of an exponential function with complex 
exponent [I, 176] that: 


e(a+b)x — e%(cos ba + isin bx). 
We get by differentiation in accordance with the usual rules: 
(e(@+5)x)’ — ae®™ (cos bx + isin bz) + be™ (— sin ba + i cos ba), 
or, on taking į outside the second bracket and noting that 1/i = —i, 
(e@+b)*)’ — ae (cos bx + isin bx) + bie (cos ba + isin bz) = 
= (a + bi) e% (cos ba + isin bx) = (a + bi) e@tFdx, 


which is what we wished to prove. 
We now turn to the solution of a linear homogeneous equation 
of the second order with constant coefficients: 


y” + py’ + ay = 0, (20) 
where p and q are given numbers. We substitute a function of the 
form e™ for y in the equation, where r is a real or complex number 
which we require to find: 


y=el™, . (21) 
We get by differentiating and taking e™ outside the bracket: 
e™ (r? + pr + g) = 0, 
so that (20) will be satisfied if r is a root of the quadratic equation: 
r+or+tgq=0, (22) 
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this latter being called the characteristic equation of equation (20). 
If the quadratic equation has two distinct roots, r = 7, and r = T» 
(21) gives us two linearly independent solutions of the equation: 


=e, Ya = e", (23) 


This follows easily from the fact that the ratio e: e" = e&-™* 
is not a constant. We now take the case when (22) has equal roots, 
ie. when p? — 4g = 0, the single root of the equation being given 
here by: 

n=nr=— 2. (24) 


Since the method described has led us now only to the one solution 
y, =e’, the other solution remains to be found; this is done by 
applying the following argument. 

We slightly alter the coefficients p and q so that the roots become 
distinct; for instance, we may let the root 7, keep its former value 
(24), whilst the root r, is made slightly different. With this, two 
solutions (23) are obtained. We subtract these two solutions and 
divide by the constant (7, — 7,), which again gives us a solution [24]: 


efx — efx 


We now let the altered coefficients p and g tend to their original 
values, for which equation (22) had a double root. With this, r, tends 
to r, and both numerator and denominator tend to zero in (25), 
so that the fraction as a whole tends to a limit equal to the derivative 
of e™ with respect to r at r = r,. The second solution of the equation 
is thus y, = xe™*”. Hence, in the case of equal roots of equation (22), 
we have the following two linearly independent solutions: 


Y =e; Yo = ven”, (26) 


We can verify by direct substitution that y, is in fact a solution 
of the equation. The left-hand side of (20) becomes: 


(rj ren 4- 2r, en”) + p(r, gen” + e) + que = 
= gen" (ri + pr, + 9) + e™* (2r, + p). 


The first term on the right is zero, since r == 7, is a root of (22), whilst 
the second term is zero by (24); hence y, is a solution of equation (20). 

We consider the coefficients p and q as real numbers. But the 
roots obtained on solving the quadratic equation (22) may be either 
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real or complex. If (22) has real and distinct roots, (23) gives two 
linearly independent real solutions, and the general solution of the 
equation is: 
y = CO," +- Cy 6", (27) 

Let equation (22) have complex roots; they must be complex con- 
jugates [I, 189], ie. 7, = a+ fi and r, =a — fi, and (23) gives 
the solutions: 

y, = e@*FOx — e°% (cos Bx + isin Ba); 

Yo = e@—Fd* — e° (cos Ba — i sin Bx). 


We obtain further solutions by taking the linear combinations of 
these solutions: 


1 1 P 
z (Yı + Y2) = e™ cos px, PA (Yı — Ye) = e% sin Bu. 


These two solutions are also linearly independent, so that in the case 
when equation (22) has complex roots r= a + fi, the general 
solution of the equation is: 


y = e™ (C, cos Bx + O, sin Bz), (28 
Finally, if (22) has equal roots, the general solution is, by (26): 
y = (Ci + 0z a) e™. (29) 


We also note the particular case of (28), when equation (22) has pure 
imaginary roots, i.e. when a = 0. We must have p = 0 here, whilst 
q must be a positive number. If we write q = k’, we have roots + ki 
for (22), and the equation 


y" + hy =0 (30) 
therefore has the general solution: 


y = C, cos kx + C, sin kg. (31) 


28. Non-homogeneous linear equations of the second order with 
constant coefficients. We now take the non-homogeneous equation 


y” + py’ + gy = Ka), (32) 


where p and q are given real numbers as before and f(x) is a given 
function of x. To find the general solution of this equation, it is sufficient 
to find any particular solution and add this to the general solution 
of the corresponding homogeneous equation (20). Since the general 
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solution of the homogeneous equation is known, the particular 
solution can be found with the aid of a quadrature by using the 
method of variation of the arbitrary constants [25]. Let us take as 
an example an equation of the form: 


y” + by = f(a). (33) 


The general solution of the corresponding homogeneous equation 
is given by (31), and we must seek the particular solution of equation 
(33) in the form: 


u = V (x) cos kæ + vy (x) sin kx, (34) 


where v,(x) and v(x) are required functions of x. Equations (16) now 
lead to a system of two linear equations for the derivatives of these 
functions: 
vi (x) cos ka + v, (x) sin kæ = 0 
— vi (x) sin ka + v, (x) cos kæ = -i fa). 


Solving these gives us: 
v (4) = — - te) sin kx; vj (x) = Ha) cos kx. 


We write the primitives as integrals with variable upper limits and 
with the variable of integration denoted by &: 


v(e) = ~— = f f(g) sin kë dë; 0, (ax) = f f(E) cos kë dë, 


where 2, is a fixed number. Substitution in (34) gives us the particular 
solution: 


ieee — Ske (Hg) sin ké dé + mH f 18) cos kë dẹ (34,) 


or, on taking under the integral sign the factors independent of the 
variable of integration: 


u= = f(E) sin k (x — £) dé, (34a) 


Te 


and the general solution of (33) becomes: 


y = C, cos ka + Cy sin kæ +- | AE) sin k(w —ê) dé. 
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We notice one point in connection with (34,). The variable x has a 
double role on the right-hand side of this expression. Firstly, æ is 
the upper limit of the integral, and secondly, it appears as an additional 
parameter (and not as variable of integration) under the integral 
sign, being reckoned constant whilst the integration is carried out. 
It is easily shown that the particular solution (34,) satisfies zero 
initial conditions at 2 = 2, i.e. 


Urzo =0, | eaoxe = 0. (343) 


The first of these equalities follows directly from (34,), since the 
upper and lower limits of integration coincide at x = £, and the 
integral vanishes. The second equality is obtained by finding u’ from 
(34,), bearing in mind that the derivative of an integral with respect 
to its upper limit is equal to the integrand at the upper limit. We 
get after obvious cancelling: 


uw’ = sin kx Fr) sin kë dé +. cos ka fr) cos ké dé, 
z; Žo 
whence the second equality of (34,) follows at once. 


29. Particular cases. With special forms of the right-hand side of equation (32), 
the particular solutions can be found much more simply, without recourse 
to the method of variation of the arbitrary constants. We start by proving a 
lemma. Let the right-hand side of (32) be the sum of two terms: 


y + py’ + ay = f(x) + fe (2), (35) 


and let u,(x), u(x) be particular solutions of the non-homogeneous equations 
whose right-hand sides are respectively f(x) and f,(2), i.e. 


uj + pui + qu, = fh (£); U3 + pus + gua = fa (x). 
We obtain on adding: 
(thy + Ua)” -+ (tly + Ue)’ + (ty + ua) = fy (£) + fe (2), 


so that (u, + u,) is a particular solution of equation (35). 
Let us now take a non-homogeneous equation of the form: 


y” + py’ + qy = aeks, (36) 


where a and k on the right-hand side are given numbers. We make use in future 
of an abbreviated notation for the left-hand side of equation (22), writing 


pr) = 7? + pr +q- (37) 


We shall seek a solution of (36) in the same form as its right-hand side, i.e. 
in the form: 


y =a, ek, 
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where a, is a required numerical coefficient. On substituting this in (36) and 
cancelling e, we get an equation for a, which, by (37), can be written in the 
form: 

p(k) a, = a. 


If k is not a root of (22), i.e. p(k) Æ 0, this last equation gives us a,. Now let 
k be a simple root of (22), so that p(k) = 0 but p’(k) # 0 [I. 186]. We shall 
seek the solution of (33) here in the form: 


y = a, xekx, 
We obtain on substituting in the equation and cancelling e“: 
p(k) a,x +g (k) a, =a, 


or, since o(k) = 0, 
p (k) @a =g, 


whence we find a, since p’(k) Æ 0. Finally, if k is a double root of (22), so that 
p(k) = g'(k) = 0, it is easily shown, as above, that the solution of the equation 
is to be sought in the form: 


y = a, z? kx, 


The same method can be used for finding the solution in the more general 
case, when the right-hand side has the form P(x) e, where P(x) is a polynomial 
in x. If k is not a root of equation (22), the solution is to be sought in the form: 


y = P; (x) ex, (38) 


where P,(x) is a polynomial of the same degree as P(x) and the coefficients 
of P,(x) are required to be found. On substituting (38) in the equation, cancelling 
out e, and equating coefficients of like powers of x, we obtain equations 
for the coefficients of P (x). 

In the case of k being a root of equation (22), the right-hand side of (38) 
has to be multiplied by the factor x or x?, depending on whether k is a simple 
or a double root of (22). 

We now turn to the case of a right-hand side containing trigonometric func- 
tions. Let us take the equation, to start with: 


y” + py’ + qy = e& (a cos læ + b sin læ). (39) 
By using the expressions [I, 177]: 
lxi —lxi ixi _ aixi 
cos ls = ti ni sin le = Pe, 
2 2i 


we can put the right-hand side of equation (39) in the form: 
Actt4)* 4 Belk-l)x, 


where A and B are constants. If the complex numbers (k + li) are not roots of 
equation (22), the solutions must, in accordance with the above, be sought in 
the form: 

y= Ayo tx 4 B e60, 
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or, as can be seen on returning from exponential to trigonometric functions via 
the formula 
etlxi == cos læ + i sin lx, 


the solution of (39) is to be sought in the form: 
y = of (a, cos lx + b, sin lz), (40) 


where a, and 6, are required constants. Similarly, it can be shown that the right- 
hand side of (40) must be multiplied by x if (k + li) are roots of (22). The 
constants q}, b, are obtained by substitution of expression (40) in equation (39). 
We remark that, if say only cos lx appears in the right-hand side of (39), we 
still have to take both the coslx and the sin lx terms in the solution (40). 

We note a more general result, without dwelling on its proof. If the right- 
hand side has the form: 


e [P (x) cos lx + Q(z) sin Ie], 


where P(x) and Q(x) are polynomials in w, the solution must be sought in the 
same form, 
ek [P2 (x) cos le + Q, (x) sin la], 


where P,(x) and Q,(x) are polynomials in x of degree equal to the greater of 
the degrees of P(x) and Q(x). If (k + i) are simple roots of equation (22), a 
factor x must be written in front of the solution. 


30. Linear equations of higher orders with constant coefficients. 
We state without proof in the present section properties of higher 
order equations analogous to the above. Later on, we explain the 
general theory of linear equations with constant coefficients by 
using a special method, known as the method of symbolic factors. The 
present properties will then be proved. 

A homogeneous equation of the nth order has the form: 


yg + py t+... + Pray’ + Pry = 9, (41) 


where 7,, P» --., Pn are given real numbers. We write down the 
characteristic equation, analogous to equation (22): 


ret prt tt py iy P+ Py = O. (42) 


For every simple real root of this equation, r = 7,, there isa correspond- 
ing solution y =e”. If the root has multiplicity s, the following s 
solutions will correspond to it: 


e*, we, ..., aS tel, 
The solutions 


e°% cos Bx and e™ sin Ba. 
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correspond to a complex conjugate pair of simple roots r = a + fi. 
If the roots are not simple but have multiplicity s, the following 2s 
solutions correspond to these: 


e% cos Ba, xe™ cos Bx, ..., 25-1 e°% cos Bx 
e™ sin Bz, xe% sin Bx, ..., 2° e™ sin px. 
In this way, all the roots of equation (42) lead us to solutions of 


equation (41). On multiplying these solutions by arbitrary constants 
and adding, we get the general solution of the equation. 


To discover a particular solution of the non-homogeneous equation: 


yP + ayy) +o. Pr Y + Pay =f (2) 


we make use of the method of variation of the arbitrary constants [26]. 

If the right-hand side has the form P(x) e, where P(x) is a polynomial and 
k is not a root of equation (42), the solution of the equation can be sought 
in the form y = P,(z) e, where P,(x) is a polynomial of the same degree as 
P(x). If k is a root of (42) of multiplicity s, we have to put y = 2° P,(x) eM: 
If the right-hand side has the form 


f(x) =e [P (x) cos læ + Q (x) sin læ}, | (43) 


and (k + li) are not roots of equation (42), the solution is to be sought in the 
same form: 


y = o" [P: (x) cos læ + Q, (x) sin lz], 


where the degrees of polynomials P(x) and Q,(x) must be taken equal to the 
greater degree p of polynomials P(x) and Q(x). 
On the other hand, if (k + li) are roots of (42) of multiplicity s, the factor 
x* must be written in front of the right-hand side of the last formula. 
Examples. 1. We take the equation 


y” — by’ + by = 4 sin 2x. 
The corresponding characteristic equation 
r?—Sr+6=0 


has roots r, = 2 and r, = 3. The general solution of the homogeneous equation 
becomes 


Cy e™ + 0,0. _ (44) 
The particular solution of the equation is to be sought in the form: 
y = a, cos 2x + b, sin 2x. 
We get on substituting in the equation: 
(2a, — 10b,) cos 2x + (16a, — 4b,) sin 2x = 4 sin 2z , 


which gives 
2a,—10b,=0; 16a, — 4b, = 4, 
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whence a, = 6/19 and b, = 1/19, ie. the particular solution is: 


5 1. 
y = -jg Cos 2x + -g sin 2x. 


We obtain the general solution of the equation on adding this to (44). 
2. We take the fourth order equation: 


yf — 2y” + 2y” — 2y’ + y =x sin g. 
The corresponding characteristic equation 
rt — 273 4. 2r? — 2r 4-1 = 0 


can be put in the form: 


(2-41) (r— 1)? 0 


which has a double root r, = r, = 1 and a pair of imaginary conjugates r,,r,= 
= +12. The general solution of the homogeneous equation becomes: 


(0, + 0,2) e” -+ 0, cos x +0, sin x. (45) 


On comparing the right-hand side with (43), we see that here k = 0, l = 1, 
p = l, and k 4 li = +i are simple roots of the characteristic equation, so 
that the particular solution must be sought in the form 


y = 7 [ (ax + b) cos a + (cx + d) sin z] = (ax? + ba) cos x + (cx? + dx) sin 2, 


where we require to find the coefficients a, b, c, d. 


31. Linear equations and oscillatory phenomena. We indicate the 
importance of linear equations of the second order with constant. 
coefficients in the study of oscillatory phenomena 
----4--4.-. We denote the independent variable by ¢ (time) and 
the function by 2; this notation will often be used 

in future. 
oe Ik We consider a body of mass m suspended from a 
spring and oscillating vertically about its position of 

Fie. 22 equilibrium, where the weight of the body is exactly 

balanced by the elastic force of the spring. 

Let x be the vertical distance of the body from the equilibrium 
position (Fig. 22). Suppose that the motion takes place in a medium 
whose resistance is proportional to the velocity dx/dé. 

The following forces will act on the body: (1) the restoring force 
of the spring, tending to return the body to the equilibrium position, 
which we shall take as proportional to the displacement x of the body 
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from the equilibrium position, and (2) the resistive force, proportional 
to the velocity and acting in the opposite direction. The differential 
equation of motion becomes: 
d?r dz 
an =bg 


dz da 
cx, or m- +b-7- + cw = 0. 


We consider as a second example the motion of a simple pendulum 
of length l in a medium of resistance proportional to the velocity. 
The differential equation of motion becomes, as is familiar from 


mechanics: 
d2 0 


ml aT 


= — mg sino —b, (46) 


where 0 is the angular displacement of the pendulum from the 
equilibrium position. If the oscillations of the pendulum about the 
equilibrium position are small, we can take the angle 0 for sin 9, 
and equation (46) reduces to: 


d?6 GU) 
If there is an additional external force, depending on time, acting 
on the pendulum, we get a non-homogeneous equation instead of (47): 


d? 6 dé as 
The motion in both the above cases is defined by a linear differential 
equation of the second order with constant coefficients. 
We shall write this equation in future in the form: 


So + th + Me=0 (49) 
or 
d. 
SE + ah- + be = fit). (50) 


We generally arrive at such an equation when considering the 
small oscillations of a system with one degree of freedom about its 
position of equilibrium. The term 2h da/dt comes from the resistance 
of the medium or from friction, k being called the coefficient of 
resistance; the term k? x comes from the internal forces of the system, 
tending to return it to the equilibrium position, k? being referred to 
as the coefficient of restoration; and the term f(t) in equation 
(50) is due to the external disturbing forces that act on the system. 
An equation of the type written is encountered, not only in the study 
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of the oscillations of mechanical systems, but also in other problems 
of oscillatory phenomena in physics. We take as an example the 
discharge of a condenser of capacity C through a circuit with resistance 
R and inductance L. If v is the voltage across the plates of the con- 
denser, we have for the circuit 

di 


o=Ri+L ait 


(51) 


where 7 is the current in the circuit. In addition, the relationship 
is known to hold: 


EE (52) 


Let there also be a source of electromotive force # in the circuit, 
which we shall take to be positive if it acts in opposition to the 
direction of i. We have in this case instead of (51): 


On substituting (52) in the equation written, we get the differential 
equation: 


LoS. + ROS? +0=E 
or . 
dtu R adv v E 
“ae ToD a o = to" (53) 


On comparing this equation with equation (50), we see that the 
term (R/L) dv/dt is analogous to the term due to resistance, the term 
v/LC is analogous to that due to the restoring forces, whilst the 
term E/LC corresponds with the term from the disturbing force. 

If we find v from equation (53), we can also find 4, by substituting 
in (52). 


32. Free and forced oscillations. We consider the homogeneous 


equation 
xz” + 2hx' + k?x=0, | (54) 


corresponding to the case when no external forces are present. The 
solution of this equation gives the free, or proper, vibrations. The 
corresponding characteristic equation will be: 


T2 + 2hr + k? = 0. (55) 


We split up the further discussion into separate cases. 
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1, Damped vibrations. In most cases the coefficient of resistance 
h is fairly small compared with the coefficient of restoration k?, 
so that (h? — k?) is negative: k? — k? = —p?. Equation (55) has in 
this case conjugate imaginary roots: 7,,r, = —h + pi, and we have 
for the general solution of (54): 


x =e" (C, cos pt + C, sin pt). (56) 
On setting 
C, = Asing; C, = Á cosg, (57) 


solution (56) can be written in the form 
x = Ae™™ sin (pt + 9), (58) 


or, if we write p = 27jt, 
x= Ae" sin (= +- p). (59) 


Here, t is the period of free vibration, A is the initial amplitude, 
and o is the initial phase. If the resistance of the medium is neglected, 
i.e. we put h = 0, the roots of equation (55) are r = +ki, and we 
obtain in place of (58): 


x= Asin (kt + p). (60) 


This gives us a pure harmonic oscillation of period t = 2z/k. 
Formula (59) represents damped oscillations [I, 59], the speed of 
damping being characterized by the factor e~™. In an interval of 
time equal to the period, the amplitude decreases in the ratio e~™. 
The values of the constants C, and C, in (56), or what amounts to 
the same thing, constants A and 9 in (58), depend on the initial 
conditions. Suppose the initial conditions are: 


Lipa = To; X|p_o = To: (61) 


On substituting t = 0 in (56), we get C, = 2. We now differentiate 
(56) with respect to t: 


x’ = — he" (C, cos pt +- Cz sin pt) + pe” (— C sin pt + O, cos pt), 
whence we find, on substituting t = 0: 


Ca = xg + ha, 


cte, (62) 
and the solution satisfying initial conditions (61) is finally: 
z = et (x cos pt ane. sin pt) (63) 
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We notice that the coefficient of damping h and the frequency of 
oscillation p = //k? —h? in solution (63) are completely defined by 
the coefficients in equation (54). As regards the amplitude A and 
initial phase p, which are dependent on the initial conditions, we can 
write, by (57): 


xo + hay 


Asing =x; A cosg = 7 


2 


from which A and y may be determined. If h = 0, p must be replaced 
throughout by k. 
2. Aperiodic motion. If (A2 — k?) is positive: 


ht — e= g, 
the roots of (55) will be: 
r= —ht+q m= — hgq, (64) 
and we have [27]: 
z = OC, e07! 4 O, emrt, . (65) 
Since we obviously have here q < h, both roots of (64) are negative, 


and x therefore tends to zero on indefinite increase of t. 
We differentiate equation (65) with respect to t: 


x’ = C, (q — h) e= — Cz (g + h) e0, (66) 
On putting t = 0 in (65) and (66), we get two equations for C, and C, 
in terms of the initial conditions (61): 

O, +02 = zo; (g — h) 0 — (4 + h) C: = To 
whence 


_ (g+h)ro+tzó , _ (g—h) x — 2% 
= thats o= Wen 


3. Special case of aperiodic motion. If, finally, h? — k? = 0, equation 
(55) has a double root 7, = 7, = —h, and we get [27]: 


æ =e (0, + Cyt). (67) 


Since te~" tends to zero on indefinite increase of t [I, 66], expression 
(67) also tends to zero. 
The non-homogeneous equation 


x” + Qha’ + ha = fit), (68) 
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in which the right-hand side f(t) is due to external forces, defines 
forced vibrations. In the case corresponding to pure harmonic free 
vibrations, we have: 

a” + Wa = f(t) (69) 
and the general solution here is [28]: 


t 
z = O, cos kt + O, sin kt + -7 [ Hu) sin k(t — u) du, 
$ 


where the last term on the right gives the pure forced vibrations, 
i.e. the solution ef equation (69) satisfying the zero initial conditions: 


Timo = X lio = 0. (70) 
It can be shown, by using the method of variation of the arbitrary 
constants, that in the case when the free vibrations are damped, the 
particular solution of (68) satisfying initial conditions (70) is 


t 
zo (t) = = e—ht f ohu f(u) sin p(t — u) du, (71) 
0 


In the aperiodic case, the particular solution becomes: 


t t 
x (t) = =e tfer-o 4 f(u) du — S e(t | efarma f(u)du. (72) 
0 0 


We leave the proofs to the reader. 


33. Sinusoidal external forces and resonance. In practice, the right-hand side 
is often found to be sinusoidal: 


x” + 2ha’ + k? æ = Hsin (wt + po). (73) 


We shall seek the solution here in the form of a sinusoidal quantity, of the 
same frequency w as the right-hand side [29]: 


a= N sin (wt + po + ôo) (74) 


We need to define the amplitude N and phase displacement of this vibration. 
We substitute expression (74) in equation (73): 


— w? N sin (wt + p, + 8) + 2hw N cos (wt + pa + ô) + 
+k? N sin (wt + py + 6) = H, sin (wt + ¢). 


We write the argument of the trigonometric functions on the left of this 
last equation as the sum of the two terms (wt + 9,) and ô. We now use the 
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formulae for the sine and cosine of a sum, and obtain: 
[(k2 — wœ?) N cos ô — 2hwN sin 6] sin (wt + p) + 
+ [2h@N cos 6 + (k? — w?) N sin ô] cos (wt + 9) = H, sin (wt + 9,). 
On equating the coefficient of sin (wt + p,) to the constant H,, and the 
coefficient of cos (wt -+ pọ) to zero, we get two equations for N and ô: 
(k? — wœ?) N cos 6 — 2hoN sind = H,; 2hwN cosé + (k? — œ?) N sind = 0. 
We solve these with respect to cos ô and sin 6: 


(k? — o?) H, 


= 2hoH, 
NI — w2)? + 4h ot]? 


~ N[(k? — w2)? + 4h? o2] 


cos 0 = sin ô = 


Squaring both sides of each and adding gives: 
Ho K 
N? [(k? — w?)? + 4h? œ?] ; 


whence we find 
H, ? 
Viki ot)? aho | 


(78) 


On substituting this value for N in the above expressions for sin ô and cos 0, 

we obtain the formulae for 06: 
: k? — w? e 2hw 

cos ô = —— —; sind= 22 ee 

V (k? — 02)? + 4h? w V (k? — w2)? + 4h? w 


(76) 


Having found N and 6, (74) now gives the sinusoidal part of the solution 
of equation (73), whilst its general solution becomes 


x= Ae Mt sin (pt + p) + N sin (wt + po + ô), (77) 


where A and ¢ are arbitrary constants, determined by the initial conditions. 
We assume here that h? — k? = —p? <0, ie. that the free vibrations are 
damped. The first term in (77) rapidly decreases with increasing ¢, due to the 
presence of the factor e~"t(h > 0), so that this term only has a noticeable 
influence on æ for é close to zero (transient process); afterwards, x is deter- 
mined almost exclusively by the second purely sinusoidal term, which is inde- 
pendent of the initial conditions (steady-state process), 

We now investigate expressions (75) and (76), which define the amplitude 
N and the phase difference ô between solution (74) and the right-hand side of 
equation (73). 

If the right-hand side of equation (73) consisted only of the constant Hy, 
the equation would be 


a” + Qhe’ + ke = Hy 
and would have an obvious particular solution, in the form of a constant: 


H 
ano 
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This is the statical deflection, which would be produced by a constant force. 
We introduce into the discussion the ratio 


which gives a measure of the dynamical susceptibility of the system to the 
action of external forces. We get by using (75) and the expression for é, 


ir es 
VE — ow) + Ah? ot yh a) ah? wot 
~ Re a 


0.25 05 0% 10 125 15 175 20 q 
Fig. 23 


It is clear from this last expression that 4 depends only on the two ratios 


w 2h 
Po ae (78) 


The mechanical significance of the first ratio may be explained as follows. 
If there were no resistance, the free vibrations would be given by (60): 


x = A sin (kt + p) 


and the period would be t = 22/k. Let the period of the disturbing force be 
denoted by T = 22/w. We now have for q: 


T 
=z 9 
I= 7 (79) 
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i.e. q is equal to the ratio of the period of free vibration of the system without 
resistance to the period of the disturbing force. 
We now have for A: 
1 


4 SS? 
Va- gye 
where g is as described above, whilst the constant y is obviously independent 
of the action of external forces from its definition. Since h is small, y is usually 
small, and if g is not close to unity, the value of 4 is approximately 1/(1 — g?). 
Figure 23 illustrates 4 as a function of q for various given values of y. 
On dividing numerator and denominator in expressions (76) by k?, we get 
the formulae: 


(80) 


cos 6 = (1 — q?) À; (81) 


sin ô = — yqi, 


These give the phase difference between the external force and the disturbance 
produced by it. 

Since A depends on q, it is indirectly dependent on the period T of the external 
force. Let us find the maximum of å as a function of g. All we need for this is 
to find the minimum of 


= (1 — gt)? + y? g? 


as a function of q?. Itis ae shown that the minimum occurs with q? = 1 — 
— y?/2, and is equal to (y? — y4/4). Hence it follows that maximum A occurs 


with 
go -% (82) 
and is equal to 
oe _ 1l 
max y T 
4 


For y small, the g corresponding to maximum A is near unity, i.e. the period 
of the external force which, with a given amplitude, produces the greatest 
effect is close to the period of the free vibration. The difference between these 
periods, which depends on y, is due to the presence of resistance. 

If there is no resistance, y = 0, and maximum A is infinity, occurring atq = 1. 

In this case, characterized by the conditions h = 0 and w = k, equation 
(73) becomes 

a” + kh? x= H, sin (kt + Po)» (83) 


and its solution cannot in fact be sought in the form (74). 
We suggest that the reader prove that equation (83) has the solution 


g= — 


H, 
The t cos (kt + po) 


which contains ¢ as a factor [29]. 
We return to the case when resistance is present, i.e. h Æ 0. As is clear from 
the graph, A increases rapidly before the maximum, and decreases rapidly 


33] SINUSOIDAL EXTERNAL FORCES AND RESONANCE 101 


afterwards. This is also easily seen from (80), with small y. On substituting in 
(81) Amax and the expression for q of (82), we get: 


ee S 
cos ô = 5 


whence it is clear that, for greatest effect of the external force, and for small 
y, the phase difference ô is close to (— 2/2). 

We now return to (77). Even for fairly small values of ?, the first term, giving 
the free damped vibrations, will be small compared with the second. We shall 
now vary w, ie. the period T of the disturbing force. By what has been said 
above, the following effect will now be obtained: as T approaches a certain 
value, the forced vibrations will rapidly increase, attain a maximum, and then 
rapidly fall off as T passes the value concerned. This phenomenon is called 
resonance. It is encountered in a great variety of processes of an oscillatory 
nature: in mechanical vibrations, electrical oscillations, sound, etc. 

We now suppose that the right-hand side of the equation contains the sum 
of several sinusoidal quantities: 


m 
x” + Qha’ + k? æ= X Hsin (wit + pi). (84) 
i=l 


For every term on the right, there is a corresponding proper forced vibration 
of the form 
N; sin (wt + p; + ôi) (@==1, 2, ..., m), 


where N; and ô; are given by (75) and (76), if the right-hand side of the equation 
is known. The sum of the above forced vibrations will correspond to the sum 
of the external forces, i.e. the 
particular solution of equation 
(84) is [29] 


m 
r= = N; sin (wt + gi +å). (85) 
i= 


We now show how the ampli- 
tudes and phases of the terms on 
the right-hand side of equation 
(84) can be found when they are unknown, by observing the forced vibrations. 

Suppose that we are able to vary k?, i.e. the period r of the free vibration. 
The following effect will now occur: as r approaches a certain value 1,, the 
amplitude of the forced vibrations will rapidly increase, reach a maximum, 
then fall off on further variation of z; it will now remain small until t approaches 
a new value 1,, corresponding to a second maximum of amplitude of the type 
just described, and so on. 

These maxima are due to resonance with the individual external forces appear- 
ing on the right-hand side of equation (84), and the values of 1t,, 7, ... give 
approximately the periods of the external forces. If we plot the periods of 
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the free vibrations along the axis of abscissae, and the amplitudes of the forced 
vibrations on the axis of ordinates, we obtain a curve with several maxima 
(Fig. 24). 

For t = tj (or k = kj = 2n/t;), the term for which w; is near k; will be 
large compared with the other terms in the sum (85). By observing experimentally 
the maximum values of the amplitudes of the forced vibrations, and taking 


these as approximately equal to the Nj, we can use the formulae: 
N j ~ emer pero ’ 
Vík; — wj}? + 4h? wt 
whilst bearing in mind that k; is close to wj, to find the approximate values of 
the intensities of the forces: 


Hj ~ 2hkjNj. 
34, Impulsive external forces. We consider forced vibrations without friction, 
a” +k? x= f (t) (86) 


and take a special type of external force f(t), acting only in a short interval 

of time, from ¢ = 0 to t = T, which rises from zero at the beginning of the 

interval, reaches a positive maximum, then diminishes to zero (Fig. 25). 
The genera] solution of equation (86) has the form [32]: 


t 
2=C, cos kt + O, sin kt + = | f(u) sin k (t — u) du. 
0 
Let the system be in the equilibrium position with zero initial velocity at 
t = 0: 
T lpm = T | tay = O. (87) 


We know that the particular solution 
corresponding to these initial conditions is: 


t 
z=7 fiw sin k (t — u) du, 


Fie. 25 o 


which we shall now investigate. 
When ¢> T the integral reduces to the integral over the interval (0, T), 
since by hypothesis 


f(u)=0 for u>T. 
Consequently, 
T 
c= -y fiw sin k(t — u)du for t>T. 
0 
or 


T T 
c= ~ sin kt frw cos ku du — + cos kt frw sin ku du. 
0 0 


35] STATIOAL EXTERNAL FORCES 103 


Since the function f(u) is positive by hypothesis in the interval (0, T), we 
can apply the mean value theorem [I, 95] to the integral written: 


T T 
f f(u) cos ku du = cos k0, T f f(u) du 
ò 6 
(0<6, and 6, < 1) 
T T, 
f Ku) sin ku du = sin k0, Tf f(u) du. 
0 0 


Let the duration T' of the action of the external force be small compared 
with the period of free vibration t = 22/k. 

Since kT = 22 T/r now becomes a small quantity, we can replace cos k0, T 
by unity and sin kð T by zero, so that we get: 


g = ŞI sin kt, (88) 
where 


T 
I=f f(t) de 
0 


is the magnitude of the impulse of the external force. 
It is easily verified that (88) is identical with the formula for the solution 
of the equation: 
æ” +k x=0 


with the initial conditions [32]: 
Teo = Blee 5L, 


i.e. if the action of the external force is of small duration compared with the period 
of free vibration, the vibration of the system on cessation of the external force will 
occur as a free vibration with the system driven from its equilibrium position 
with initial velocity I. 


35. Statical external forces. We now make a different assumption about the 
force f(t); we let the total interval of action of the force, (0, T), be split into 
two intervals (0, T,) and (7, T), such that the force is increasing in the first 
and decreasing in the second sub-interval; and we further suppose that the 
period of free vibration t = 2x/k is small compared with the duration of the 
increase (and decrease) of the force. 

We now solve equation (86) with initial conditions (87). We get by integration 
by parts, and on noting that f(0) = 0: 


1 u=t 1 a 
x =y fu) cos k(t — u) cS — aif (u) cos k(t — u) du = 
0 


(89) 


t 
a =a z aff (u) cos k(t — u) du. 
0 
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The first term f(t)/k? is called the statical deflection, produced by the force 
f(t). We get this expression from (86) by neglecting the term x”, i.e. neglecting 
the dynamic nature of the action of the force, 

The second term is the correction that has to be given to the statical effect 
in order to obtain the actual dynamic effect of the force. This second term 
can be put in the form: 


t 
— -ir | F (u) cos k (¢ — u) du = 
0 


t t 
= — = cos ufr (u) cos ku du — = sin ufr (u) sin ku du. (90) 
; 0 0 

Let us consider the interval of increase of the force, so that we have t < T,- 
In order to simplify the argument, we shall assume that the first derivative 
J’(é), which is positive in the interval (0, 7',), is diminishing, i.e. that the growth 
of the force becomes slower in the course of time. We show that, given this 
assumption, the two integrals on the right-hand side of equation (90) are small 
in absolute value. We shall only discuss the integral containing sin ku, since 
the other integral can be treated in a like manner. 

We subdivide the total interval of integration (0, t) into half-periods of the 
free vibration, 7/2 = z/k, and let the number of full half-periods included 
in t be m, so that: 

Tt 


ma 


<t<(m +>: 
We now have 
t 
t 2 T 
Sf (u) sin kudu = f P (u)sin kudu + È f (u) sin kudu +... + 
0 0 T 


2 


t 
ma t 
+ f f(ujsinkudu+ f f (u)sinkudu, 
(m—l) $ m ka 


and the last interval (m1/2, t) will in general be less than 1/2. 

Since sin ku does not change sign in each of the sub-intervals into which 
the total interval has been divided, we can apply the mean value theorem 
{I, 95] and, bearing in mind that kr = 22, we can write 


(+1) = (s+) 
f f (u) sin ku du = f’(us) | sin ku du = 
t 
i a 
1 u=(s+1) = 
= j’ (us) [cos ku] x = 
u=s— 


2 


poe +r (us) [cos (8 + 1) 2 — cos ax] = (— I f(y)=(- (us), 
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where 
t 


2-2 


<u < (6 +1) (8=0, 1, 2,....m—1). 


Similarly, we have for the last interval: 


t 
f ff (u) sin ku du = (— 1)" 6f (um), where O < 0 < Land m—- < tm < t 
T 


m — 
2 


Hence we have: 
t 
[F (D sinku du = EF (ta) — 1 (0) + P (tg) — 2 
0 


+ (— DTI (um-1) + (— BON (Um). 


In view of our assumption regarding f(t), the terms of the alternating series 
decrease in absolute value on moving away from the initial term; the total 
sum therefore has the (++) sign but is less than the first term [I, 123]: 


t 
o< Í f (u) sin ku du < + af’ (u). 
0 


For r small, zf’(u,) is approximately equal to the increment of f(u) in the 
interval (u, u, + 7) [I, 50], ie. zf’(u,) is roughly equal to the change in the 
force in an interval of time equal to the period of free vibration. 

If this interval is so small by comparison with the total interval of increase 
of the force that the above change in the force can be reckoned negligible, 


the integrals 
t 


t 
fr (u) sin ku du and Sr (u) cos ku du 
0 0 
will be small in absolute value, and the second term on the right-hand side of 
equation (89) will, by (90), be a small quantity compared with the first term. 
Precisely the same argument applies for the interval in which the force is 
decreasing. Hence, if the period of free vibration is small compared with the 
total duration of the action of the force, the deviation produced by the force can be 
found from the statical deviation. 
It follows from the above discussion that t must be so small by comparison 
with T that the change in the force during the interval t can be neglected. 
If the derivative f’(t) is not always decreasing during the interval of increase 
of the force, but has a single maximum, as is often the case in practice, the argu- 
ment remains the same in essence as that given above. The only difference lies 
in the fact that, in summing the alternating series, it has to be divided into two 
parts, and the prevailing term in the sum will be a middle term, corresponding 
to the particular interval in which the maximum of f’(t) occurs. 
The possibility..of determining statically the deflection due to the external 
force is of importance in devices designed for recording this force. We shall 
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take as an example the indicator of a steam engine. This amounts to a 
cylinder with a close-fitting piston. The latter is subject to the pressure of 
the steam and compresses an elastic spring. 

Let s be the area of the piston, f,(t) the pressure of the steam, k? the rigidity 
of the spring, m the mass of the piston, and x its displacement. The equation of 
motion of the piston is 

ma” = —kia+sf,(t), or a + k g= f(t), 
where 


k? 
k? = = and f(t) = AO ; 


The value of x is given by (89). The second term on the right-hand side of 
this expression represents the instrument error. For the error to be small, 
the period of free vibration of the piston on the spring must be small compared 
with the duration of the action of the force. Given this, the reading of the indi- 
cator will approximate closely to the curve of f(t), i.e. to the curve of the external 
force (to within a constant factor). If the pressure increases so rapidly, however, 
that the change in pressure is significant during an interval equal to the period 
of free vibration, the indicator readings will diverge considerably from the 
pressure curve.t 


36. The strength of a thin elastic rod, compressed by longitudinal forces 

(Euler’s problem). If a thin, straight elastic rod AB, the ends of which can 

move along the line AB (Fig. 26), is subjected to two 

| forces P, acting on its ends and compressing it along its 

Bg--- axis, distortion of the axis of the rod, leading to its collapse, 

can occur at a critical value of the force. The problem of 

_ finding the force capable of producing such distortion (the 

l problem of the so-called “longitudinal bending” of the rod) 
was first stated and solved by Euler. 

Let } be the length of the rod AB, E the modulus of elas- 
ticity of the material of the rod, and I the moment of iner- 
tia of its cross-section, which we can take as constant over 
all ita length [16]. 

Let OX be taken from the end A along the axis of the 
rod to the end B, and let y denote the ordinate of the 
elastic curve of the rod. The differential equation of the 

elastic curve becomes in this case:tf 


d? 
EI Sh = — Py (91) 
or, putting g? = P/EI: 
d?y 2uy=0 (92) 
Jar TFYS O. 


+ A more detailed account of this problem may be found in A. N. Krilov’s 
article, ‘“Nekotorie zamechaniya o kresherah i indikatorah” in Izvestiya 
Akademii Nauk, 1909. 

tt The bending moment of one of the forces P is evidently (—Py) for any 
section of the rod. 
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The general solution of this equation is: 
y = 0, cos gx + C, sin gz. (93) 


The fact that the ends A, B must remain on axis OX gives us the condi- 
tions: 
Ylxeo = Ylxut = 0. (94) 
We notice that these are not initial conditions. Initial conditions specify 
the value of the function y and of its derivative y’ for a definite value of x. 
Conditions (94) specify only y, though for two values of the independent variable, 
at the ends of the interval (0, l); they are called, in fact, boundary conditions. 
We substitute x = 0 and v = 1 in the general solution (93): 


O=C,;; 0O=O,cosqli+C,singl and 0, =0; C,singl =0. (95) 


These equations have the obvious solution C, = C, = 0, which. by (93), 
gives y = 0, i.e. the straight form of the rod. For distortion of the axis to be 
possible, we must have C, Æ 0, which means that sin gl = 0. Hence q must 
take one of the values: 


q=- (8 =0, few: (96) 
The first solution s = 0 makes g and y zero, and agair gives the straight 


elastic curve. The least non-zero value of q is obtained for s = 1: 


_ a 
oer ae 


On substituting this value in the equation q? = P/EI, we get the least value 
of the force capable of producing distortion : 


n? EI 
BRB? 


P, = Elĝ = (97) 


or the so-called critical force (Euler’s formula). 
The curve along which the rod bends for P = P, will have the equation: 


_ n 
y = C, sin —— 2. 


l 


i.e. consists of a half sine wave (Fig. 26). The state of equilibrium is unstable, 
and considerable deformations are possible. 
We find, on setting « = 2 in (96): 


27 
e ia 


The equation of the axis of bending of the rod now becomes: 


y= 0, sin 2% r, 


l 


and the bending curve consists here of two half waves. 
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The force P, needed to produce this deformation is: 


2 
P, = Elg = Ae aE : 
and is thus four times greater than the previous force. 

On giving successive integral values to s, we get all the possible equilibrium 
forms of the bending axis of the rod. These will consist of a corresponding 
number of half sine waves, whilst the forces required for the appearance of 
these distortions will be proportional to the square of the number of half waves. 

It may be pointed out that differential equation (91) is approximate, in the 
sense that the curvature of the bending axis of the rod is taken equal to the 
second derivative; the equation therefore only applies in regard to small deforma- 
tions of the rod. The conclusions drawn from general solution (93) of the equation 
are not justified as regards forces P which produce considerable bendings of 
the rod, and can clearly lead to absurd results. 

Numerous experiments with long, thin rods have shown that the rod at 
first preserves its straight shape with gradually increasing P, then suffers con- 
siderable distortion of its axis on P reaching a value near P,, as defined by 
(97); the bending thereafter increases with great rapidity as P continues to 
increase. 

The role of the boundary conditions (94) must be mentioned. Given the initial 
conditions, the solution of a linear equation is uniquely defined. A different 
situation arises with boundary conditions, as we have seen. Particular values 
(96) of the coefficient g in equation (92) exist, such that, given boundary condi- 
tions (94), the equation has, apart from the obvious solution y = 0, solutions 
which are defined up to an arbitrary constant factor. We shall meet with the 
same situation in the example below [37]. 


37. Rotating shaft. Experiment shows that the following effect occurs on 
rapid rotation of a long, thin shaft: as the angular velocity increases, it reaches 
a certain value w = w, at which the shaft no longer remains straight but begins 
to wobble; as w increases further, stability is again achieved for a time, then is 
lost again at @ == w, and so on. We explain the reason for this effect and the 
method of calculating the critical velocities: w,, w,, ... 

Generally speaking, a rotating shaft has a straight shape at equilibrium, 
but at the above-mentioned critical velocities the shaft can have a bent dyna- 
mical equilibrium shape in addition to the straight equilibrium shape; with 
this, any chance factor can lead to distortion of the shaft, which causes it to 
wobble. 

Let the shaft be supported at its ends x = 0 and v = l, and let y denote 
the amount of bending, as usual. Each element dx of the bent rotating shaft 
is subjected to a centrifugal force (p/g) w*y dx, where p is the weight per unit 
length of the shaft and g is the acceleration due to gravity. On taking this 
force as a continuously distributed load, we obtain from equations (25) and 
(32) [16]: 


diy _ pœ? 
EL-i = gT 
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or, writing 


i eee 
2 
a= | Sr 


yGv) — gty=0. (99) 


we have: 


The corresponding characteristic equation rt — qt == 0 has roots: +q, qi, 
and the general solution of (99) becomes: 


y= 0,0e% + C,e—-& + C, cos gx + C, sin gz. 
The deflection and bending moment must both be zero at the supported 
ends; we thus have the four boundary conditions: 


_, dy _ dy 
¥\ dz? SY ha dat 


= 0, 


x=0 Xæl 


These can easily be seen to reduce to the system of equations: 


C,e% +0, 674l — C, cos gl — O, sin ql = 0; 
O, e% + 0, 674l + C, cos gl + 0, sin gl = 0; (100) 
C,+0,+C,=0; C,+0,—0,=0. 
The solution 
C, =0,=0,=0,=0 (101) 


corresponds to the obvious identity y = 0, i.e. to the straight shape of the shaft. 
We now find the values of q for which system (100) has solutions differing 
from (101). 

The first two equations give: 


C,= - Cy C,=0. 
We get by substitution in the last two equations: 
Cı =C,=C0;,=0; QC, sin gi = 0. 
If C, Æ 0, we must have sin gl = 0, which gives the values for q: 


q= —— (8=1,2,...). (102) 


On using (98), we get the following expression for the critical velocities: 


8? n? AI 
a= EY a (8 =1, 2, 3,...). 


38. Symbolic method. We now come to a fresh method of integrat- 
ing a single linear equation or a set of linear equations with constant 
coefficients. The method can be applied, with suitable generalization, 
to more complex problems. It consists essentially in a symbolic 
notation for the operation of differentiation with respect to the 
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independent variable t, by writing a D to the left of the function to 
be differentiated. Thus, if x is a function of t, 


da 
Da = rari 


and in general, for any positive integral s: 


da 


Maeg 


(103) 


If a is a constant, obviously 
DS (ax) = aD zx, (104) 


i.e. the law of transposition holds for the product of a symbolic 
factor and a constant. If F(D) is a polynomial in D with constant 
coefficients: 
F(D) = a D” + a, D! 4... + ani D Hap 
the operation F(D) z is defined as: 
F(D)x = a D" x +a, Dx 4.. ue Dz + apt = 
dř z d-t g 
= lo -gma att + as, der! is $y 1G +a, T. 


If g(D) is the product of two polynomials p,(D) and pD), we 
have, using (104) and the obvious equality D™ (D™ x) = D™*™ x: 


pı (D) [p2 (D) £] = (D) x, 


where the factors ¢,(D) and ¢,(D) can be transposed. 
Evidently, in the same way, 


[pi (D) + pa (D)] x = 9 (D) £ + pa (D) x, 


and the result obtained is independent of the order of the terms 
pı(D) and @,(D). 

The ordinary rules of addition, subtraction and multiplication thus 
extend to the symbolic polynomials now introduced. 

By (104), a constant factor can be taken outside the sign of a 
symbolic polynomial, so that, along with (104), we have 


F(D) (ax) = aF (D) x; 
of course, this is not permissible with factors dependent on t. We now 
prove the formula: 
F(D) (e™ x) = e™ F(D + m) x, (105) 
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where m is a constant. The expression indicates that a factor of the 
form e™ can be taken outside the sign of a symbolic polynomial after 
substituting (D + m) for D in the latter. 

The expression F(D) (e™! x) consists of terms of the type 
an-s D' (e™ x), and it is sufficient to prove (105) for all such terms, 
i.e. we need only prove 


Ds (e™ x) = e™ (D+ myz. (106) 
If we use Leibniz’s formula for differentiating a product, we can 
write [I, 53]: 


Z as (em! x) 
= dts 


+ C2 (em )S—-) g" 4... OF (eh C—O of d... eM a), 


Ds (em 2) as (em) (s) x + Cc} (emt)(s—D a’ + 


where the superscript in brackets indicates the order of the derivative 
with respect to t, and C% is the number of combinations of k from s 
elements. Since (e™)® — m?e™ and zP = D” x, we can write, on 
taking e™ outside the brackets: 


DS (e™ x) = e™ (m x + Clms—) Da + Olm? D? a +... + 
+ Okms-* D¥at...+ DS x)= e" (ms 4- Otm D+ 
+ O2ms-2 D? +... 4+ Ok m= DE... + D’) x. 


But the right-hand side is identical with the right-hand side of (106); 
(106) is thus proved, which amounts to proving (105). 

We now define negative powers of D as operations the inverse of 
differentiation, i.e. we define D~* f(t) as the solution of the equation 


Dsx = f(t), (107) 


where, in order to give a precise meaning to the symbol D~° f(t), 
we agree to take the solution which satisfies the zero initial con- 
ditions: 

Ulpnty = L heta Se EHD | = O. (108) 


In other words, we shall take [15] 


t 
D> f(t) = enti f (t — u)! f(u) du. (109) 
to 
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The general solution of equation (107) now becomes [15]: 


t 
£ = D= f(E) + Poi (8) = qy | E — ww) du + Po (t), 
i (110) 


where P,_, (t) is a polynomial in t of degree (s — 1) with arbitrary 
coefficients. 
We define the more general operation (D — a)~* f(t) as the solution 


of the equation 
(D — a} x = f(t), (111) 


satisfying conditions (108). To find this solution, we bring in a new 
unknown z instead of x, where: 


g= e“ z. (112) 


On substituting in (111) and using the rule expressed by (105), we 
obtain the equation for z: i 


e” (D +a — a) z= f(t) or Dz =e“ f(t). (113) 
The solution of this equation, which satisfies the conditions: 
limt = 2 lieto = + = 2E [panty =0, (114) 


can be determined in accordance with (109), provided we write 
e™% f(t) here in place of f(t): 


t 
z= a ie = ii fe — u)ste— f(u) du. 
to 


But it follows from the formulae: 
Dix = Dietz = e* (D + a} z (j = 0, 1, 2,...,8—1) 


that, if z satisfies conditions (114), z, as defined by (112), satisfies 
conditions (108). On substituting the expression found for z in (112), 
we get the required solution of (111): 


eat 
(e@— TD)! 


t 
fe — u)5-1 e7% f(u) du. (115) 


(D — a= f(t) = 
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The general solution of this equation is obtained if we multiply 
the general solution of equation (113) by e“, i.e. the general solution 
will be: 

a == (D — a) fit) + 0% P, (t) = 
t (116) 
= ei f(e — uso fu) du + e% Po (t), 
te 


where P,- (t) is a polynomial in ¢ of degree (s — 1) with arbitrary 
coefficients. 
In particular, setting /(t) = 0, we get the general solution of the 
equation 
(D —a)'x=0 (117) 
as 
æ = e“ P, (t). (118) 


39. Linear homogeneous equations of higher orders with constant 
coefficients. A linear homogeneous equation of the nth order with 
constant coefficients is of the form: 


L L a D 4+ FH apt Hanem O. (119) 


If we denote differentiation with respect to t by the symbolic 
operator D and introduce the polynomial: 


(D) = D" +a, Dr +... + an1 D + am 


we can write the equation as: 


¢(D) x=0. (120) 

The characteristic equation corresponding to equation (119) is: 
Hari... t anr t a [= O, (121) 
with, say, roots 7,,7,,...,7m Of multiplicities ki, ko ..-, km, where 
kit ke +... tH kp=n. (122) 


On factorizing the polynomial g(D), we can write equation 
(120) in the form: 


(D — r)“ (D — 1g)" ... (D — rpne = 0. (123) 
The equation 
(D — Tm)” £ = 0, (124) 
has, by (118) [38], the general solution 


x = e'm P, _. (t), (125) 
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where P (t) is a polynomial of degree (km — 1) with arbitrary 
coefficients. 

Function (125) will clearly also be a solution of equation (123). 
We see this by substituting (125) in (128), when we get zero as a 
result of the operation (D — rm)"; the operations 


(D — r) (D — ra)... (D — Tm)", 


multiplied by zero, evidently also give zero. We could now transpose 
the factors so that some other factor, say (D —7,)“, stood next 
to x. It may be seen, by means of this device, that a series of particular 
solutions exists: 


z, = et Pult) (8 =1,2,...,m), (126) 


where P(t) is a polynomial of degree (ks— 1) with arbitrary 
coefficients. 

By assigning to s all the values from 1 to m in (126), and adding 
all the solutions thus obtained, we arrive at the solution of equation 
(123) [26]: 


g = e" P, (t) + es Pai (t) +... + emt Pini (É) (127) 


Each polynomial Px _,(t) of degree (ks — 1) witharbitrary coefficients 
contains altogether k; arbitrary constants, and therefore, by rela- 
tionship (122), solution (127) contains in all n arbitrary constants. 
In view of this, it may be surmised that (127) represents the general 
solution of equation (119), i.e. that every solution of this equation 
is included in (127). 

This was proved above by expression (118) of [38] for the case of 
m == 1, so that it remains to show that, if our assertion is justified 
for the case of (m — 1) factors of the form (D — r,)", it is also justified 
for m factors. The proof is as follows. Equation (123) can be rewritten 
as: 

(D — 1,)"(D — ra)... (D — tm) y = 0, 
where 
y = (D — Tm)" z. 


We suppose that our statement is proved for (m — 1) factors, 
so that we have the general solution for y: 


y=(D— Tm)” z= Qr (t) + ent Qra—ı t)+...+ 
+ emt DV im—r(t); 
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where the Q,,-,(t) are arbitrary polynomials of degree (k, — 1). 
On writing 
x= elm z, (128) 


taking e™ outside the sign of the symbolic polynomial, and dividing 
both sides of the equation by e™', we get: 


Dim z= elim)! Qui (t) H ete Tmt Qr- (t) T oog F 
F oC mmt QVkma—1 (t) ` 


We get the general expression for z on integrating the right-hand 
side km times with respect to ¢ and adding a polynomial of degree 
(km — 1) [15]. Now we know [I, 201] that the integral of the product 
of the exponential function e“ and a polynomial of degree k in ¢ is 
of the same form. Thus, z must be of the form: 


J efnt Piya (£) + efm Pri (t) Feot 
+ efm m)t Primal (t) + P igi (t) j 


On using (128), we see that x must be as given by (127), which is 
what we required to prove. 

In particular, if the roots of the characteristic equation are all 
simple, all the P(t) are polynomials of zero degree (k, = 1), in 
other words, they are arbitrary constants Cs; here, the general solution 
of the equation has the form: 


x = C et + Coe™+...4+ C, 0%. 


Assuming that the coefficients of equation (121) are real, some of 
its roots may nevertheless be complex. The terms in solution (127) 
corresponding to these complex roots are easily reducible to real form 
by passing from exponential to trigonometric functions. Suppose that 
(121) has a pair of conjugate complex roots (y +- ôi) of multiplicity k. 
The solution corresponding to these will be of the form 


0HE S, (6) + e0500 P, (£) = oF fol! Sr (1) + 0 Ty, (8), 


where S,_,(é) and Tkt) are polynomials of degree (k — 1) with 
arbitrary coefficients. If we write 


e*! — cos ôt + isinôt;  e~* — cos ôt — i sin ôt, 
we obtain the solution in the form 


et [U p (t) cos ôt +- V,_, (t) sin ôt], 
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where U;_,(t) and V(t) are polynomials of degree (k — 1) with 
arbitrary coefficients, related to S;,_,(t) and T,.,(t) via the expressions 


U p (t) = Sp (E) + Tr 6) Vim (t) = ESk (4) — Ty H 


The above discussion leads us to the following rule [27]: equation 
(119) must be integrated by first forming the corresponding characteristic 
equation (121) and finding its roots. For every real solution r = r’ of 
multiplicity k’, there is a corresponding solution of the form: 


et Pya (t), 


where Py (t) is a polynomial of degree (k’ — 1) with arbitrary coef- 
ficients; and for every pair of conjugate complex roots r = (y + ôi) of 
multiplicity k, there is a corresponding solution of the form 


et [U „— (t) cos ôt + V p~ (£) sin ôt], 


where Ux,(t) and Vrlt) are polynomials of degree (k— 1) with 
arbitrary coefficients. Addition of all the solutions thus obtained gives 
us the general solution of equation (119). The polynomials reduce to 
arbitrary constants in the case of simple roots. 


40. Linear non-homogeneous equations with constant coefficients. 
A linear non-homogeneous equation is of the form 


p(D)a = f(t), (129) 


where f(t) is a given function. Its general solution is obtained [25] by 
adding to the general solution of the corresponding homogeneous 
equation, which we know already how to obtain, a particular solution 
of it, which we must now proceed to find. We use the symbolic method 
and split the rational fraction 1/g(D) into partial fractions [I, 196]: 
I va m Ka AWM 
pD) qi Dry 


Let the function (t) be defined by 


m k AM 
E(t)= S 2 wl, (130) 


s=] q=1 
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the meaning of this expression being fully defined since, in accordance 
with (115) of [38], each member of the right-hand side has a definite 


meaning: 
t 


(4) r 
pee O= AP pS tweet fw du. (131) 


(Ci SE 
to 


It is easily shown that (130) in fact gives a solution of (129). We 
consider 


By definition of the symbol (D — rs)", if we operate on the right- 
hand side of (131) with (D — r,)f, the result is AMf(t). Since the 
polynomial ¢(D) is divisible by (D — r,)', we have g(D) = Ps(D) 
(D — rs)", where ps(D) is a polynomial. Hence we can re-write the 
previous formula as: 


But it immediately follows from the expansion of I/g(D) that 


5 5i A® p (D) = 1, and therefore p (D) E(t) = f (t), 
s=1q=1 


so that a solution of equation (129) is actually given by (130). 
We see from this that finding a solution of (129) for any given 
function f(t) reduces to splitting a rational fraction into partial 
fractions and then integrating. 

The particular solution of equation (129) is found more simply in 
some particular cases by the method of undetermined coefficients, 
as demonstrated in [29], than by the general formula (130). 

We remark that formulae (71) and (72) of [32] are easily derived 
by using the above symbolic method. 


41. Example. We take as an example the equation 
ai’) 4.99" + x= t cost. (132) 
Here, the characteristic equation becomes 
ri + 2r? -+ 1=0 or (r?+1)? =0. (133) 


This has a pair of conjugate, double roots r = +t. The general solution of the 
homogeneous equation corresponding to (132) is 


(Cit +C.) cos t+ (C +C,) sin t. (134) 
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We see by comparing the right-hand siJe of the given equation with (43) of 
[29] that here, k = 0, l= 1, and P(x) = 1, Q(x) = 0. The numbers k + li = +i 
coincide with the pair of double roots, so that the solution of (132) must, by 
[29], be sought in the form 


x= l? [(at +b) cos t+ (ct + d)sin t]. (135) 


The problem will be simplified by writing the right-hand side of (132) in the 
exponential form. If we do this, and at the same time write the left-hand side 
in the symbolic form, (132) becomes: 


(D? + 1)? z = i e 5 et, (136) 


We shall have to look for the solution in the form 
x= 1? (at + b) e” + t (ct +d) e". (137) 
We substitute this expression in the left-hand side of the equation: 


t 


jl t . 

it ait 
ze + ae . 
2 


(D + i)? (D — i)? t (at + b) ef + (D + i)? (D — i)? t? (ct + d) o” = 5 


On taking e! and e~* outside the symbolic polynomials in accordance with 
rule (111), we get: 


ol (D + 2i)? D? (att + bit) +o (D — 2Y Dt (0 + de) = + olf 4 5 ol, 
or, on writing the second derivative instead of D?: 
e (D? + 4iD — 4) (Gat + 2b) + en {D2 — 41D — 4) (6ct + 2d) = 5 et- > ef, 
We carry out the differentiations: 
[ — 24at + (24ai — 8b)] ef + [— 24ct — (24ci + 8d)] e7 = — elf + — 0 


Hence, by the method of undetermined coefficients: 


-24a =- ; Mai 8b=0; — Me = i; 2de + 8d =0, 
or 
1 l., ie aes ee 
aa OS Tg RS ag ag 


On substituting in (137), we obtain the solution: 


t3 es, 
=—— — -> t, 138 
x 31 cost g sin (138) 
and the general solution of equation (132) becomes: 


2 


3 t ‘ 
æ= (O,t-+C,) cost + (Cst +C,) sint — pg cost — -p sin t. (139) 
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42. Euler’s equation. This equation has the form 
Mae) + a, Mle") 4 ...4a,_,tz + ane = O, (140) 
where d,,d,,...,@, are constants. We show that it reduces to an 
equation with constant coefficients on replacing the independent 
variable ¢ by t, defined by the expression 
i= of, (141) 


We denote differentiation with respect to t by the symbolic factor D 
as previously, whilst symbolic 6 will denote differentiation with 
respect to t. We obviously have 


da da dt ~ dx 


or, in symbolic notation: 
Du = e ôg. (142) 

If we operate on the left-hand side with D, and on the right-hand 

side with the equivalent e~* ô, we get: 
D? x = e—*8(e—7 ô) x. 

On taking the factor e~* outside the 6 sign, in accordance with the 

rule of (111), we find: 
D? x = e~% (ô — 1) da = e7% ô (ô — 1) a. 
This expression, together with (142), suggests the following general 


formula: 
Ds x = e7 (ð — 1)...(8— s+ 1)z. (143) 


We have to show that, if the formula is true for s symbolic factors, 
it is true for (s + 1) factors. Let (143) be assumed true, and let us 
operate on the left-hand side with D, and on the right-hand side 
with the equivalent e~* ô; we get: 

DS+! x = e~* ô [e—* ô (ô —1)...(6—s +1) a], 
whence, taking e~* 
Dstt y = ete (6 — s) ô (8 — 1)... (ô — s + 1) a= 


= e~S* §(§ — 1)...(6 — s + 1) (ô — $) z, 


outside the ô sign: 


which shows that (143) is true for any s. 
On writing ¢ for e", the formula becomes: 


t Dx = ô (ô — 1)... (8 — s + 1)z. (144) 
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It follows that, as a result of transformation (141), every term 
ans x on the left-hand side of (140) is replaced by a term 
ans (ô —1)...(8—8 + 1)z, 


which does not contain the independent variable t; hence we obtain 
the linear equation with constant coefficients: 


[ô (ô — 1)... (ô — n + 1) + aô (ô — 1)... (8 — n -+ 2)+...+ 
Fan Han] £ = 0. Lae) 
The characteristic equation corresponding to this will be: 
r(ir—l)...(r—n+1)+a,r(r—1)...(rp—-n+2)4+...4 
+ a@,-17 +a, = 9, 
so that the general solution of (145) is: 
æ = et Puma (T) +e Py (T) +... e Prai (T), 


where the r, are roots of (146) of multiplicity ks and the Pp, — (T) 
are polynomials of degree (k; — 1) with arbitrary coefficients. 

On returning to the original variable via (141), we get the solution 
of equation (140) as: 


g = t" Ppi (logt) +t Pp (logt) +...+ t Peai (logt). (147) 


If the roots of equation (146) are all simple, the solution of 
(140) becomes: 


(146) 


a= 0t + Otep... H Ont n. (148) 


Equation (146) is obtained, as is easily seen, if the solution of 
(140) is sought in the form x = t. 
Given a non-homogeneous equation of the form 


Me™ y a tolgda te + anw = tt Pilogt), (149) 
where P (logt) is a polynomial of degree p in log t, it may easily be 
seen, by using transformation (141), that the solution can be sought 


in the form: 
x = (log t) t* Q(log t), (150) 


where Q (log t) is a polynomial of degree p in log t and s is the number 
of roots of equation (146) equal to a. 
We can take, instead of (140), the more general equation of the 
type: 
(ct + d) x + a, (ct + dtd H.. H 


+a, (ct + d) x’ +a," = 0. (151) 
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The variables must here be transformed by the formula: 
citd= et’, 
instead of by (141), whilst instead of (144) we now have the expression: 
(ct + d) D5x = c ô (6 —1)...(6 —8 + l)z, 


with the aid of which equation (151) also reduces to an equation with 
constant coefficients. 


43. Systems of linear equations with constant coefficients. The 
position of a mechanical system is in many cases defined, not by one, 
but by several independent quantities q4, q2, ..-, x, which are referred 
to as parametric coordinates. The number k of these gives the number 
of degrees of freedom. In the case of rotation of a rigid body about a 
fixed axis, for instance, we have one degree of freedom, this being 
the angle 6 by which the body has turned about the axis. The rotation 
of a body about a fixed point provides three degrees of freedom, 
and the parametric coordinates can be taken, for instance, as the 
Eulerian angles, y, y and 6, which are familiar from the dynamics 
of rigid bodies. The motion of a point-mass on a plane, a sphere, 
or any other surface, has two degrees of freedom. The parametric 
coordinates may be taken as the ordinary rectangular coordinates 
x and y in the case of a plane, or as the longitude ¢ and latitude p 
in the case of a sphere. 

With the motion of a mechanical system, its parametric coordinates 
q Yor ++ +> qk are functions of time t, and are defined by a system 
of differential equations with initial conditions. In particular, when 
considering the small oscillations of a system about a position of 
equilibrium, for which the corresponding parametric values are 


Ye | eee = g=, 
usually only first order terms in qs and dq,/di are retained in the 
differential equations, so that we have a linear system with constant 
coefficients. Every equation will in general contain all the qs and 
their first and second order derivatives with respect to t. 
Given two degrees of freedom, the system will have the form: 


a, 9y + bigi + cigi + a293 + baga + Cago = O; | 
digi teat figi + daqi + e293 + faga =0, J 


where qi, qi, 93, 92 are derivatives of g, and q, with respect to t. 


(152) 
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On using the above notation of symbolic D for differentiation with 
respect to ¢, we can write (152) in a different form: 


(a, D? + b, D + c1) q1 + (gD? + by D + Cg) 92 = 0; 
(d, D? +e, D + fy) + (dy D? + ea D + fa) ga = 0. 
If external forces act on the system, there will be known functions 


of ¢ instead of zero on the right-hand sides of the equations. 
The initial conditions have the form: 


(153) 


Qi li=0 = 910 Wi li=0 = gio; 92 limo = 920; V2 tao = 120» 


where qio Zio» 92o Yao are given numbers, and the general solution of 
system (153) must contain four arbitrary constants. 

We show how integration of system (153) reduces to integration 
of a single linear equation of the fourth order with one unknown 
[20]. We do this by bringing in an auxiliary function V of t, 
putting: 


qı = — (aa D? + ba D + ca) V; qo == (a, D? +b, D +0) V. (154) 


On substituting these expressions for g, and q, in equations (153), 
we see that the first equation will be satisfied for any V, so that it 
remains to choose V such that the second equation will also be 
satisfied. 

Substitution of expressions (154) in this second equation gives us 
a fourth order equation for V:f 


[(a, D? + b, D + ¢,) (da D? + e D + fo) — 


155 
— (aD? +o D+ e(d DHe Dayo O 


Having found V, we get q, and q, from (154) by straightforward 


differentiation. 
Let 7,, Ta» 73,7, be the non-repeated roots of the characteristic 


equation: 
(a, T? + bir + c1) (da1? + ear + fr) — 
— (aT? + bar + c) (dir? 4+ eyr + fi) = 9. (Eee) 
so that 
V = O,e™ + Cie! + Ozet + Cye™. (157) 


t We assume that a, dg — a,d, Æ 0; this is always the case when consider- 
ing the motion of a material system. 
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We substitute this expression in (154), recall that De” = re" and 
D? e” = re", and get a general expression for q) and q,. It will 
consist of a linear combination of four solutions, each of which con- 
tains an arbitrary constant factor. The solution V = C,e™ gives, 
for instance: 


qı = — Cy (Agri + baT, + cghe™; qa = O, (ari + bir, + ¢,)e™. (158) 
If equation (156) has complex roots, as is usual in applications 


the solution of equation (155) can usefully be written in trigonometric 
form, so that the solutions for V: 


C,e“cosbt and C,e% sin bt, 


correspond to a pair of conjugate roots r = a + bi. 
Similarly, if (156) has a double root 7, = 7, the solutions become 


0e- and Otot. 


We now notice the case when the above method does not lead to 
the general solution for q, and g,, containing four arbitrary constants. 
Let equation (156) become, for a certain root 7;: 


ari + biT, + Ci = agri + bary + C2 = 0. (159) 


Expressions (158) for q) and q, now vanish identically, and the 
general solution of the system will not contain the arbitrary con- 
stant C,. We can try to restore the lost constant by using the equations 


qı = (da D? +e D + fa) V; qa = — (d D? +6, D + fi) V. (160) 


instead of (154), when we introduce the auxiliary function V. 

With this, the second of equations (153) will be satisfied for any V, 
whilst substitution of expressions (160) in the first equation (153) 
will give us the same equation (155) as above for V. The root r, of 
the characteristic equation (156) now gives, instead of (158), the 
expressions for g, and qs: 


dı = Ci (dari + eari + fo)e; qa = — O (diri + eiT, t fi) e™. 


Provided one at least of the factors (d, rj + e, 7, + fı) and (dr? + 
+ €,7, + f2) does not vanish, the solution corresponding to the root 
r = r; of (156) has now been restored. 

The case still remains to be considered when, in addition to rela- 
tionships (159), we have 


diri + er + fi = dari + eri + f= 0. (161) 
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With this, the method given above does not lead to restoration of 
the solution corresponding to root r= r; of equation (156). But 
since (159) and (161) are valid, all the quadratic expressions in 
brackets on the left-hand side of (156) have the root r= r, ie. 
are divisible by (r — 7,). It follows that r = r, must be a repeated 
root of (156). We confine ourselves to the case when r =r is a 
double root, and indicate the corresponding solutions of the system. 
These two solutions will be: 


q =C; qg =0 (162) 
qı = 0; qz = C,e™. (163) 
If, in fact, we substitute say expressions (162) in the left-hand 
side of either of equations (153), we obtain an identity, by (159) 
and (161). 
These solutions are distinct, since q, is identically zero in the first, 
whereas it differs from zero in the second. 


We remark that if, in the case of arepeated root 7, = 72, say, one 
of relationships (159) is not fulfilled, we obtain, on substituting 


V =C,e™andV = C, te™ 
in expressions (154), the solution (158) and a solution which contains t 
as a factor: 
qı = — Cy (aa Ti + baT, + 02) te™ + Cp, e™; 
qe = Cz (a, T} + biT, + c) te™ + O, p,e™, 


where p, and p, are definite constants. 
The general solution of the non-homogeneous system: 


(a, D? + b, D + c1) q + (a D? +b, D + ca) qo = fi (6); 
(d, D? +e, D + fi) ti + (da D? + & D + fe) de = fo (t), 
consists, as in the case of a single equation, in the sum of the general 
solution of the corresponding homogeneous system (153) and any 


particular solution of the non-homogeneous system. If the free terms 
fit) and f(t) are of the form 


A, e* cos Bt + B, e% sin ft = De“ sin (Bt + o), 


| (164) 


the particular solutions can be sought in the form 


qı = A, e% cos ft + B,e sin ft; q = Ae“ cos Bt + Be“ sin ft, 
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provided only that (a + £ i) is not a root of equation (156). On sub- 
stituting this expression in the left-hand side of equation (164) and 
equating coefficients of e” cos Bt and e™ sin Bt, we get equations 
for determining A,, B,, A,, B,. 

The particular solutions of system (164) can be obtained for any 
/,(t) and f,(t) in the same way as in the case of a single equation [40]. 
On solving system (164) for q, and q, we get, for example, for q,: 


d, D? D+h, D? + b, D 
ise ree AD} tf hd- 7 a) ta fa (t), 


where, for brevity, A(D) denotes the symbolic polynomial on the 
left-hand side of equation (155). On expanding the rational fractions 
and using the value given in [38] for the symbolic factor (D — r)~* 
we obtain the required solution of system (164). 

We further remark that, by using the arguments of [20], we can 
easily reduce the integration of a system of linear equations with 
constant coefficients to the integration of a single linear equation 
with constant coefficients. We give a general method in Volume III 
for integrating a system of equations with constant coefficients. 


44, Examples. 1. We consider the system: 
d? y d? z 
-iz Tt Gar 7Y t ee, 
where y and z are required functions of x. On finding z from the first equation: 


d? y 
a dat 
and substituting in the second, we get the fourth order equation for y: 


diy 
“dat y 


= 2r, (165) 


the general solution of which is found by the usual rule as 
y =0, 0” +C, e7% + 0, cosa +C,sin 2 — 22. 
We substitute this expression in (165) and get the expression for z: 
z = 0, e% + C,e* — 0, cos x — O, sin v — 2. 
2. We take the system of three first order equations: 


da d dz 
a ees esate; attr. (166) 
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where x, y and z are required as functions of ¢. We solve the first equation with 
respect to y: 


az (167) 


and substitute the expression obtained in the remaining two equations, which 
gives: 


d?r dz dz da 


On substituting the expression for dz/dt given by the second equation into 
the first, we get a second order equation containing only the one unknown 
æ (exceptional case): 


d2x dx 
ae gg 
the general solution of which is: 
x = 0,0” +- 007. (169,) 


We substitute this in the second of equations (168) and get a first order equa- 
tion for z: 


dz ngi ot 
ar + 2 = 30e , 


the general solution of which is: 


z = 00™ + 0,0”. (169,) 
On substituting expressions (169,) and (169,) in (167), we obtain for y: 
y =C,0e% — (0, + 03) 04. < (169,) 


We are presented here with the exceptional case referred to in (20). Instead 
of obtaining a single differential equation of the third order, we have obtained 
an equation of the second order and a further equation of the first order. 

3. We encounter systems of linear equations with constant coefficients in 
the study of electrical oscillations, as well as when considering small oscilla- 
tions of mechanical systems about the equilibrium position. Let two circuits 
be linked magnetically, i.e. the current in one circuit produces a magnetic 
field which induces an electromotive force in the second circuit. If 4,, i are 
the currents in the two circuits, the induced e. m. f. in the first circuit will be 
M di,/dt, and in the second, M di,/dt, where M is the constant mutual induc- 
tance. If we assume that there is no current source in either circuit, we 
have the equations: 


dz di l. di 

Lae tha to atM aS CEO 
dèi. d?i di Tt 

Mge + La ag + RG +g, =O Oey 


where L,, R,, C, are the self-inductance, resistance, and capacity in the first 
circuit, and L}, R, C, are the corresponding values for the second circuit. 
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We use this example to show how one of the unknown functions can be 
eliminated and a single fourth order equation obtained with one unknown, 
without introducing an auxiliary function V. 

We substitute for d? i/dt? from (171) into (170), and get the equation 
di, M 


— R, M <2 ws —F-i,=0. (172) 


di 
(LiL — M?) -p de? +LR dé “= on on 


On differentiating this equation, and substituting the expression for Md?i,/dt?: 
d?i, di di 1 


Me ge Pa gt (173) 
got from (170), we obtain: 
dèi d2¢ L di 
— M2 i p | 2 ST 
(Ll, — M) GE + LR + LR) G+ (GE + RR) Gt + 
M ‘di. 
+ a 345 7, J=. (174) 


After differentiating once more, and again substituting for Md’i,/dt? from 
(173), we finally arrive at a linear equation of the fourth order for 7,: 


d4 i di, dèi, 


(L,L, — M*) 5 = + (J+ Gt + RR) So + 


+ (L,B, +L, B,) St 


+(+ 3) tain? aw) 


If we had eliminated 7, instead of 7,, we should have got precisely the same 
fourth order equation for 7,. Its corresponding characteristic equation is 


(1 — k?) rt + 2 (gy + ge) r3 + (ni + n3 + 49,90) r? + 
+ 2 (gm + gani) r + ning = 0, (176) 
where we have written for brevity: 


Pees ee pts, 2. R 
ZYL L’ 1~ 72,0,’ a t,0, "TZL? $= oR, 


Equation (176) can also be written in the form: 
(r? + 2gyr + nî) (r2 + 2gor + nz) — krs = 0. (177) 
If there were no magnetic coupling, we should have to put M = 0 in equations 
(170) and (171), and we should obtain two separate equations, defining the dis- 
charge processes in the circuits: 
a4, 


di. 
a +n?i,;=0 and = 2 2g ga = ar + ni,=0. (178) 


+24 Gh 


Both circuits are usually oscillatory, in other words, the characteristic 
equations corresponding to differential equations (178): 


rt 2gr +n} =0 and r+ 29.7 +n? =0, (179) 
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have complex roots, ie. g,2 — n,? < 0 and g, — n, < 0, or 


R, 2 I R, 1 
2, CVLO, ®t ah, S VLO 


or alternatively 


a < E and = < je 

With k = 0, equation (177) gives two pairs of conjugate complex roots 
[the roots of equations (179)]}. With the small values of M, such as are usually 
found in practice, (177) again has two pairs of conjugate complex roots, with 
negative real parts: 7; ,» = —a + bi and r}, , = —c + di; so that the general 
expression for 7, becomes: 


i, =C,e~™ cos bt + 0,07“ sin bt + 0,07“ cos dt + C,e “sin dt. 


We remark that, on knowing ?,, we can obtain 7, also without further integra- 
tion. All we do is find di,/dé from equation (174), substitute the expression 
obtained in equation (172), and thus get a linear equation in 7,. The expression 
for 7, will contain terms of the same form as in ĉi, with coefficients that are 
linear combinations of the constants C,, Ca, C3 and C, 

If we neglect the resistances, i.e. take g, = g, = 0, and in addition, assume 
that the circuits are tuned to the same frequency, i.e. n, = n, = n, equation 
(177) becomes 

(1 — k?) rt + 2n?r? + nt = 0, 
whence 
—n? + kn? n? 
Isk è VER’ 


r? = 


and 
n n 
tote = tye ToT = ty? (@=V¥—1). 


The solution corresponding to these purely imaginary roots is of trigonometric 
form. It follows that, given magnetic coupling between two circuits tuned to 
the same frequency, two oscillations arise, whose frequencies depend on the 
common frequency n of the circuits and on the constant k which characterises 
the magnetic coupling, in accordance with the relationships: 


n n 


d 


w = Tre n” = JIk 


= 


§ 4. Integration with the aid of power series 


45. Integration of a linear equation, using a power series. The 
solutions of a linear equation of higher order than the first with variable 
coefficients are not generally expressible in terms of elementary 
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functions, as we mentioned earlier, and the integration of such an 
equation does not in general reduce to a quadrature. The most useful 
method is to represent the required solution as a power series, as 
already mentioned in [13]. This device is particularly applicable to 
linear differential equations. We shall confine ourselves to the second 
order equation: 


y” + pl(x)y +q(x)y=9. (1) 


Let the coefficients p(x) and q(x) be given in the form of power series, 
expanded in positive integral powers of x, so that the equation has 
the form: 


Y” + (Ay + ax + aga? +...) y+ (by + Bye + dye? +...) y= 0. (2) 


We notice that the coefficient of y” is taken equal to unity. 
We seek the solution of (2) as a further power series: 


y= axs. (3) 


On substituting for y and its derivatives from (3) into (2), we get: 


œ 


> s(s— lags? + Sa > sa xt + > bas Sax? = 0. 

s=2 s=0 s= =o s=0 

By cross-multiplying the power series, collecting like terms and 
equating to zero the coefficients of the various powers of x on the 
left-hand side of the equation written, we get a series of equations: 


x| 2. lay + ay a, + By ay = 0 
21) 3-2a, + 2a) a, + a, 0, + bya, + b, a) = 0 
uw?) 4- ie ea ca ee a es ae Bt paca (4) 


as (8 + 2)(s us n "+Q, (To Ars Aa». 9 a0 


Here, Q5(a, @, az, ---, @s4,) denotes a homogeneous polynomial of 
the first degree in its arguments ay, a,, dg, ..-, A544. 

Each successive equation of the series written contains one more 
required coefficient than the previous equation. The coefficients a, 
and a, remain arbitrary, and play the part of arbitrary constants. 
The first of equations (4) gives a,, then the second gives ag, the third a,, 
and so on, so that in general, knowing the preceding ay, a, a», . 
as+ we can determine a,,, from the (s + 1)th equation. 


sty 
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It is convenient to proceed as follows. We find two solutions y, 
and Yy, by the above method, with the values a, = 1 and a, = 0 
taken for the first solution, and values a, = 0 and a, = 1 for the 
second, this being equivalent to the following initial conditions: 


Yı ean =1; Yi leat = 0, 
Yalxeo = 93 Ys |x=0 = 1. 


Every solution of the equation will be a linear combination of 
these solutions, and if the initial conditions have the form 


y Ix=0 = A; y’ |x=0 = B, 
obviously, 
y = Ay, + Bys. 


We have shown above that the coefficients of power series (3) can 
be determined successively by formal computation. It remains an 
open question, however, whether the power series thus obtained will 
þe convergent, and whether it will provide a solution of the equation. 
We give the proof in Volume III of the following proposition: if the 
series 


pl) = Xass; g(a) = S b,a 
s=0 


are convergent for |x| < R, the power series arrived at by the above 
method is also convergent for these values of x, and gives a solution of 
equation (2). In particular, if p(x) and q(x) are polynomials in x, the 
power series obtained is convergent for any x. 

Linear equations are often encountered of the form 


Po (x) y” + Py (a) y’ + Po(x)y = 0, (5) 


where P(x), P(x), P(x) are polynomials in æ. In order to reduce 
this to form (1), we have to divide both sides by P,(x), which means 
taking 
-_ Piz) , — _P2 (2) 
P(x) = P, (£) , q(x) P, (a) . (6) 


If the constant term of polynomial P (x) differs from zero, i.e. P,(0)40, 
p(x) and q(x) can be written as power series, on dividing the poly- 
nomials, arranged in increasing powers of x, and the solution of (5) 
also can be sought as a power series. With this, there is no need to 
reduce (5) to form (1), the simpler method being to substitute expres- 
sion (3) directly for y in the left-hand side of (5) then use the method 
of undetermined coefficients. 
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We have so far only considered power series, arranged in positive 
integral powers of x. It is also possible to use series arranged in 
powers of (x — a). 

Ail that has been said above evidently applies to linear equations 
of higher order than the second, except that now, instead of the 
first two coefficients remaining undetermined when finding the 
solution as a power series, the number of undetermined coefficients 
becomes equal to the order of the equation. 

Given the non-homogeneous linear equation 


y” + p(x) y’ + g(x) y = f(x), 


where both the right-hand side and the coefficients are power 
series, its particular solution can also be sought as a power series. 

We notice one point about expressions (6). Let P(x) and Q(x) be 
two polynomials in x, where P(0) 4 0. The result of dividing the 
polynomials can be written as a power series, as mentioned above: 


ae = o text ene? +...; (7) 


but now the questions arise, as to whether the series on the right is 
convergent, and if so, in what interval; also, is its sum equal to the 
left-hand side of the equation? The answers to these questions follow 
quite simply from the theory of functions of a complex variable, 
which is described in Volume IJI. Here, we simply give the final 
result: the power series of (7) is convergent for |x| < R, where R 
is the modulus (or absolute value) of the root of minimum modulus 
of the equation P(x) = 0; furthermore, equation (7) applies for these x. 
One consequence of this is that, if equation (5) is integrated direstly 
with the aid of power series, the series obtained will in fact be con- 
vergent for |x|< R, where R is the minimum among the moduli 
of the roots of the equation P(x) = 0. 


We remark that, if the convergence of series (3) is proved within the interval 
(—R, +R), it follows directly from this that the sum of the series gives a solu- 
tion of the equation. To start with, we can find y’ and y” by simple term-by- 
term differentiation of series (3) [I, 150]. Then, on substituting the expressions 
for y, y’ and y” in the left-hand side of equation (2), we can multiply term by 
term the series for yf and y with the series p(x) and g(x), in view of the fact 
that power series are absolutely convergent [I, 137, 148]. Finally, by the choice 
of the coefficients a, from equations (4), we can cancel all the terms on the left- 
hand side of (2). 
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46. Examples. 1. We take the equation: 
y” — vy = 0. 
We get by substituting series (3): 
(2- la, +3-2a,2 + 4-342? +...) — lao +a, eu Ha? +...) =0, 


whence we find, on equating to zero coefficients of like powers of a: 


Having put a, = 1 and a, = 0, we obtain successively the remaining coef- 
ficients: 


1 1 
i Sa Sa = gg 


a, = a, = 0; 


l 


"= 213-5689” 


i.e. the only coefficients as differing from zero are those for which the subscript 
s is divisible by 3, so that we can write: 


1:4:7... (3k — 2) 
(3k)! 


gk.) = Azka = O and ay, = 


We have thus obtained the solution 


n 14-7.. (3k = 2) ak 
=1 (3k)! i 


y=l+ 
k 


We get the second solution by putting a, = 0 and a, = 1. It is easily shown, 
by the same method as above, that this second solution is 


n wr 2.5.8... (3k — 1) sk41 
nere ara 


The power series obtained are convergent for all x. 
This may be verified for y, by using d’Alembert’s test [I, 121]. The ratio of 
the two successive terms becomes 


1-4-7.. (3k +1) sepa, Le4+7...(8R—2) ok 1 = 
Gk 3)! ate (3k)! T = (Bk £2) (8k +3)’ 


and the absolute value of this ratio tends to zero for all x on indefinite increase 
of k, whence follows the absolute convergence of the series. 
2. We take the equation: 


(1 — a) y” — ay’ + a? y = 0. 
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On substituting series (3) and equating to zero coefficients of x", we obtain 
the following relationships between the a,: 


(n+ 2) (n +l) ang — n(n — lb an — na, + aa, = 0 
or 
(n + 2) (n + 1) an; = (n? — a?) ap. 


On setting a, = 1 and a, = 0, we get the solution: 


a? (a? — 4) za a? (a? — 4) (a? — 16) 
4! 6! 


2 
n=l- t+ wet... 


Similarly, setting a, = 0 and a, = 1, we get: 


a —] a? — 1) (a? — 9) 
Y= m et i 


(a? — 1) (a? — 9) (a? — 25) 


x5 


The coefficient P(x) in the equation taken, where P(x) = 1 — %?, has 
roots x = +1, the absolute values of both these being unity. Hence it follows 
that the series for y, and y, must be convergent for —1 < x < +1, that is, 
for |z| <1. 

We can verify this with d’Alembert’s test. Neglecting the sign, we get for 
the ratio of two successive terms say of y: 


a? (a? — 4)...[a? — (2n)*] qnt. a? (a? — 4)...[a? — (2n — 2)?] z" 
(2n + 2)! i (2n)! 
a? — (2n)? 


npin t ™° 


We divide numerator and denominator by n?, which enables us to write 
the absolute value of the ratio in the form: 


ae | er 
ener anes 

This ratio tends to | x |? on indefinite increase of n, and obviously, | x |? < 1 
for |x| <1; thus, by d’Alembert’s test, series y, is absolutely convergent for 
|z| <1. It is also clear that the series is divergent for |x| > 1, provided 
a is not an even integer. In this last case, the series breaks off and reduces. 
to a polynomial. Similar conclusions apply regarding series y,. It can be verified 
that solutions y, and y, are expressible in terms of elementary functions, in 
fact, 


L's 
Yı = COS (a arc cos g); Yz = Ga sin (a are cos 2). 
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47, Expansion of solutions into generalized power series. Quite a number 
of equations encountered in applications have the form 


x? y” + p(x) - ay’ + g(x) y = 0, 


where p(x) and q(x) are series, as in equation (2), arranged in positive integral 
powers of x, or else are polynomials. This equation does not come under type 
(2), due to the presence of x? as a factor of the second derivative. The equation 
is said to have a regular singular point at x = 0. We write p(x) and q(x) explicitly 
as series: 


ary” + (a, +a, tar +...) ay’ + (bo +b,44+b,a%+...)y=0, (8) 


and instead of seeking the solution as the simple power series (3), look for it 
as the product of some power of x and a power series: 


y= 2? X ae’, (9) 
s=0 
Here, we can obviously take the first coefficient a, as non-zero, in view of the 
power ọ not being fixed in the factor outside. 
We substitute in the left-hand side of (8) the following expressions for 
Hy, y”: 


y= =, asats; y= 2, (e + 8) a, ze t51; 
s= 


s= 
= 2, (e + 8) (e +s — 1) ap 22t, 


On collecting like terms and equating to zero the coefficients of the various 
powers of x, we get the series of equations: 


ae |[e(e—1) + ase + by] a = 0 
zeti) f(g +1) e + a (e +1) + bo] a, + a, Ody + b, a =0 
- aor? [(@-+2)(@-+1)-+49 (@-+2)-+09] a.+@, (0+1) a, +a; Gag b; a,b, a00 i 


i bo bl‘l‘n ‘l ll ‘l l‘ ll‘ ‘ ‘Ml nl ‘ll‘ll‘ 


M e E E e E E PE E E e E E E E A N E E S O E E E E E S S 


The Qs(ao, Q, Gz, ...,4;_,) denote homogeneous polynomials of the first 
degree in their arguments ao, Ay Qz, ..., Agre 
Since a, Æ 0 by hypothesis, the first of the equations written gives a quadratic 
equation for ọ: 
F(2) = e(e — 1) + a e + ba = 0. (11) 


This is called the indicial equation. 

Let its roots be ọ,, 0} On setting either of these for ọ in equations (10), we 
get a series of equations, any given one of which contains one more coefficient 
as than the preceding equation, so that we can find successively u,, az, ... The 
coefficient a, which remains arbitrary, plays the role of an arbitrary factor 
and can be taken say equal to unity. 

The first of equations (10) has in fact become an identity after substituting 
e = 0, OF ọ = Oz, whilst the second gives a,, the third a,, etc., and in general 
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the (s + 1)th gives a, assuming ay, a,,..., @,_, are already known. The only 
requirement here is for the coefficients of the a, in the equations to be non- 
zero. It is immediately evident that these coefficients can be found from the 
left-hand side of equation (11) by substituting (eg, + $) or (e, + 8) for a, i.e. 
they are equal to F(a, + s) or F(o, + 8). 

Suppose that we have obtained solution (9), starting from the root ọ = p, 
of equation (11). If F(g, + s) # 0 for any positive integer s, the above method 
can be used successfully for determining the coefficients. 

The condition F(ọ, + s) # 0 is obviously equivalent to the other root 9, 
of equation (11) not being of the form (g, + s), where s is a positive integer 
In other words, (9, — 0») must not be a positive integer. 

The following conclusions are easily drawn from what has been said. 

1. If the roots ọ, and g, of equation (11) do not differ by a positive integer 
or zero, both roots can be used in accordance with the above method to obtain 
two solutions of the form 


y= a Y ag’; Ya = 2% X Bex? (ao and B, # 0). (12) 
s=0 s=0 


2. If (o, — o) is a positive integer, the above method can be used in general 
to form only one series: 


waar Y aa (13) 
s= 


3. If equation (11) has a repeated root, ọ, = 0, again, only the one series 
(13) can be formed. 

In view of the convergence of the series obtained, a proposition can be stated, 
similar to that made in [45]: if the series 


œ 


> ae and X ba 
s=0 s=0 


are convergent for |x| < R, the series formed above are convergent for the same 
x and represent solutions of equation (8). 
The equation 


x?P, (x) y” + xP, (x) y’ + P, (x) y =0, (14) 


reduces to the one worked out, where P,(x), P(x), P(x) are polynomials or 
series expanded in positive integral powers of zx, and P,(0) # 0. Here, as in 
[45], series (9) can be substituted directly in the left-hand side of (14), without 
dividing by P,(x). Furthermore, as in [45], we can take series expanded in 
positive integral powers of (x — a) instead of x. 

In case 1 above, the two solutions (12) are linearly independent, i.e. their 
ratio is not a constant, as follows at once from the fact that expressions y, 
and y, contain different powers of x, x?! and 2, before the summation sign. 
We have only found one solution (13) in cases 2 and 3. Formula (9) of [24] 
offers the possiblity of finding a second solution with the aid of a quadrature. 
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We merely state the result, without proof: if (@, — 0z) is a positive integer or 
zero, we have a solution, in addition to (13), of the form: 


Yo = Py, logs + 2 X pz. (15) 
s=0 


Expression y, thus differs in the present case from the ordinary expression 
(12) in having an additional term of the form fy, log x. It may happen that 
the constant £ is zero, in which case an expression of the form (12) is obtained 
for y,. All these assertions will be proved in Volume III. 


48. Bessel’s equation. This equation has the form: 
xy” + ay’ + (2? — p?) y = 0, (16) 
where p is a given constant. It occurs in various problems of astronomy and 
of pure and applied physics. 


Comparison of the equation with (8) shows that here, a, = 1 and b, = —p?; 
the indicial equation therefore now becomes: 


e(e@-1)+e—p?=0 or g—p?=0, 
with the roots 
0 =P, b= P. 
We seek the solution in the form 


y = aP (ag + aye + aye? + ...), 


On substituting in the left-hand side of (16) and equating to zero the coef- 
ficients of the powers of x, we get: 


att | [(p + 1)? — p?] a, = 0 
Pts [(p + 2)? — p?] a, + a =0 


We write a, = 1 then evaluate the coefficients successively, and arrive 
at the solution: 


a? at 


~ 2Qp +2) 242p F 2) (2p T3) 


xe 
se a 17) 
2-4-6 Op + 2 2p + I p F 6) ; 

A second solution of (16) can be obtained by making use of the second root 
0, = —p, and it obviously follows by the simple substitution of (—p) for p in 
solution (17), since (16) only contains p? and remains unchanged on making 
this substitution: 


1 — of 


a at 


2 (= 2p $2) 12-4. (— Bp +B (— Bp +4) 


Y= aP fı 


(18) 


8 
niece pipet}: 
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The difference between the roots of the indicial equation is 2p, so that both 
the solutions written are satisfactory, provided p is neither an integer nor half 
an odd integer. Solution (17) gives the Bessel function of the pth order, except 
for a constant factor; the Bessel function is usually written J,(x), and is also 
called a cylindrical function of the first kind. Thus the general solution of 
equation (16), with p neither an integer nor half an odd integer, is 


y +0: dp (x) +0zJ-p (2). 


The power series appearing in solution (17) is convergent for any x, as may 
easily be verified by d’Alembert’s test. 

Now let p = n be an integer (positive). Solution (17) remains in force, whereas 
solution (18) becomes meaningless, since one of the factors in the denominators 
of terms of the expansion vanishes as from a certain number. The Bessel function 
J,,(x) is defined for positive integral p=n with the aid of formula (17) multiplied 
by the constant factor 1/(2”+n!): 


at x? , xit 
In 2) = 5a |! TA " 2-4 (2n + 2) (2n + 4) 


as 
~ 2-4°6(2n F 2) En + 4) Qn + 6) tal (19) 


The general term in this expansion is 
gt 2s 


2n! 2-4-6 ... 28. (Qn + 2) (2n + 4) (2n + 6)... (2n + 28) ` 


(— 17 


Each of the 2s factors which appear in the denominator after 2" +”! con- 
tains the factor 2; if we take these out and combine them with the 2", we can 
write the general term in the form: 


gtts 


1/5 
) 9 12-3 ..08 (m4) (wd) 43) (w 8) 


( 


so that formula (19) can be written as 


Ihara ews (ey (20) 


Ko nts)! (2 
where we take 0! = 1 as usual. In particular, we get for n = 0: 
-y E oe 
Joo) = Bae >) = 


ay (a) T a (3) aa (3) otas R) 


By what was said in [47], equation (16) will have a second solution of the 
form: 


Kp (x) = BJ, (x) log x+" =, pa’. (22) 
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in addition to solution (20), when p = n is a positive integer. 
This solution obviously tends to infinity for x = 0. 
The general solution of equation (16) becomes for p = n: 


y =0 Jn (£) + CK, (2). (23) 


If we want to obtain a solution which is finite at x = 0, we have to take con- 
stant C, = 0, i.e. we must confine ourselves to solution (20). 
We take a closer look at solution (22) for p = 0. Here, the equation becomes: 


a ee 
toy +y=0, (24) 
and one of its solutions is given by (21). A second solution can be sought in the 
form 


BJ, (x) log x + By + Bye + Bar? +... 


On taking a linear combination of this solution with that already found, 
we can reduce the free term f, to zero, so that the final solution can be sought 
as 


BJ, (x) loga + bix + Bya?+... 

By substituting this expression in the left-hand side of (24) and using the 
method of undetermined coefficients, we can successively determine the bp 
We miss out all the working and only give the final expression for the second 
solution. We set the non-vanishing coefficient B equal to unity here: 


2 4 1 as I I 


This is called a zero order Bessel or cylindrical function of the second kind. 

Finally, let p be half an odd integer, p = (2n + 1)/2. Although the dif- 
ference between the roots of the indicial equation is here the integer (2n + 1), 
solutions (17) and (18) both remain in force and are linearly independent, 
inasmuch as we have the factors #@"*D/2 and a @"+1)2 respectively in front of 
the power series, so that their ratio cannot be constant. 

If we substitute p = 1/2, for instance, in solution (17), we get the series: 


a xi xs 
1/2 i a — 
q fı 2.37 1.3-6 nen a RARI 


1 x3 x5 a7 sin x 
-[-ate oat |= Ya ` 


On multiplying this series by the constant factor Y2jm, we get the Bessel 
function Jy. (x): 


2. 
Jux) = yg Sine. (26) 


Similarly, (18) gives us 


Jip (4) = (2 cos x, (27) 
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and the general solution of equation (16) for p = 1/2 is: 


y =o 0, Jij (x) + 0, Ja (x). 


We note some results without proof: the Bessel function with subscript equal 
to half an odd integer is in general expressed in terms of elementary functions 
and has the form: 


Jona (2) = j= |P, (=) sin s+ Qna (=) cos x | ; 
2 


where P,,(1/x) and Q,,(1/2) are polynomials in 1/x. In particular: 


Jara (w) = J+ == — cos z) ; 


ML a 


J sie @-~2(EG = 1) sin x — cos | 


2 3). 3 
Jag (x) = |2 [bone + (= = 1) cos a] 7 
Furthermore, we have for any positive integral n: 
a 2n+1 


n ——— a” . 
Ime Ee E i (E) 
2 


The even function (sin x)/z has to be differentiated n times with respect to 
æ? in this formula. 


49. Equations reducible to Bessel’s equation. We notice some equations 
that can be reduced to Bessel’s equation by a change of variables. Take the 
equation: 

ay” + wy’ + (ka? — p?) y = 0, (28) 


where k is a non-vanishing constant. We take instead of x a new independent 
variable = kx. We now have to substitute as follows in equation (28): 


,_ dy _ dy dé _, dy Pre: dy) _ 1, dy 
Y =a db * de a v= (t Ge) oa 


so that (28) becomes: 


k2 x? -gg + (atp) y=0 
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or 
Py g Y 
dé? ' dé 
which is Bessel’s equation (16) with the independent variable & Using ¿£ = ka, 
the general solution of equation (28) becomes: 


y =O; Jp (kee) + Oy J» (kee), (29) 


£2 


+ (E — p’) y =0, 


or, if p = n is a positive integer or zero: 
y =C, Jn (kz) +0, Ky (ke). (29,) 


A broad class of equations reducible to Bessel’s equation is revealed by intro- 
ducing a new independent variable ¢ and a new function u into (16), in accord- 
ance with 

y=ttu and c= yf, (30) 
where a, 8 and y are constants, and 8 and y are non-zero. Differentiation gives 
us at once: 


d 1 ps. dy 1 p-e dy 


dæ By ; da By dt 
d2 1 a a 1 p~e d2y 4 1— 8 1—8 dy ), 
dz? By By di? By dt 
and further, 
dy _ «a du ge dey a dtu 
dt! ae a : dt? de t 


+ 2al! La -q tea Dru 


If we substitute the expressions for y, dy/dx, d?y/dx? in equation (16), then 
replace dy/dt and d*y/dé? by their expressions in terms of u, du/dt, d?u/dt? 
and make simple rearrangements, we find the equation for u: 


a + (a + hy -2 U (a2? — pep + By? 2) u = 0. (31) 
Equation (16) has the general solution: 
y =O, Jp (2) + Cy S_p (x), 
so that, by (30), the general solution of (31) will be: 
u =t" y =O, t" J p (yt) +O, 6-9 Jp (yt), (32) 


where J_,(ytg) must be replaced by K,(yt®), if p =n isa positive integer 
or zero. 
Equation (31) has the form: 
2 
poe at SY + (b+ ct") u=0, (33) 
where 


2Qat+l=a; @—-Ppr=b, Pyr=ac; 2B=m. (34) 
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Given an equation of type (33), with constants c and m not zero, we can 
conversely find a, £, y and p from equations (34) and write the general integral 
in terms of Bessel functions by using (32). 

If c or m is zero, (33) is Euler’s equation [42] and simply reduces to an equa- 
tion with constant coefficients. 

We consider a particular case of equation (33): 


du 
t +a Ji - tu = 0. (35) 


Multiplication of the equation by ¢ shows that we have here: b = 0, c = 1, 
m = 2, and a arbitrary. Equations (34) become: 


2a +l =a; a? — f? p =0; Pty?=1; 28=2, 


whence we have 
a—l . a— 1l 


and the general integral of (35) becomes, by (32): 


1—a l—a 


u=0,t? Jaat) +0: ? Ji alt) 
2 ae 
where we have to replace Jq—y_ by Kg—yy2 if (1 — a)/2 happens to be a 
negative integer or zero. Equation (35) is identical with (24) when a = 1. 
Equation (33) represents a wide, general class of linear equations that are 
often encountered in applied mathematics, with general integrals expressible, 
as we have seen, by means of Bessel functions. 


§ 5. Supplementary notes on the theory 
of differential equations 


50. The method of successive approximations for linear equations. 
Mention has already been made several times of the existence and 
uniqueness theorem for differential equations. We first prove the 
theorem for the case of linear equations, and do so by making use 
of the method of successive approximations, which we introduced 
originally for the approximate evaluation of the roots of equations 
[I, 193]. 

We take for clarity the system of two linear homogeneous equations: 


=A @ytae)e Ge=pleyteac)e 0) 


with the initial conditions 


Ylx=xo = Yo: A = Zo. (2) 
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We shall assume that the coefficients in equation (1) are continuous 
functions of x in a finite, closed interval J (a < æ < b) which con- 
tains the initial value x and we take x as belonging to I in future 
arguments. 

The solutions y and z of system (1) must certainly be continuous, 
differentiable functions, and it is clear from the equations themselves 
that the derivatives dy/dx and dz/dz must also be continuous, since 
the right-hand sides of the equations are continuous with the assump- 
tions made. On integrating the equations term by term from 2, to 2, 
and taking into account (2), we get: 


ule) =y + J [p Oy) + a (20) ar 
2(x) = zo + f [pa (t) y(t) + qa (t) 2(¢)] a| 


We have written out the arguments of functions y and z here for 
ease of working, the variable of integration being denoted by ¢ so as 
to avoid confusion with the upper limit æ. Equations (1) with initial 
conditions (2) thus bring us to equations (3). 

We now prove the converse: if continuous functions y(x) and z(x) 
satisfy equations (3), they also satisfy equations (1) and initial con- 
ditions (2). The latter follows by putting x = x, in (3) and recalling 
that an integral vanishes when its limits are the same. Furthermore, 
differentiation of (8) with respect to x gives us equations (1) [I, 96]. 
Equations (3) are therefore equivalent, in this sense, to equations (1) 
with initial conditions (2), and we shall in future consider equations 
(3) only. We notice that the required functions y(x) and 2(x) appear 
both on the left-hand sides of these equations and under the integral 
sign on the right-hand sides. 

The method of successive approximations works as follows. We take 
the initial values y, and z, as first approximations to the required 
functions y and z and substitute accordingly in the right-hand sides 
of equations (3); this gives us functions y(x) and 2z,(z): 


x 


} 
Yı (£) = Yo + f [p (t) Yo + q (É) zo] dt | 


Xo 


2, (a) = 2+ § [Pa (Ë) Yo + g (t) zo] dt, 


Xo 
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as second approximations to y and z. These latter functions are 
clearly continuous in the interval J [I, 96]. We now substitute y(x} 
and 2,(z) for y and z in the right-hand sides of (3) and obtain third 
approximations y,(z) and 2z,(x): 


Yo (x£) = Yo + f [p1 (t) y (t) + g (t) z2 (t)] dt 


za (x) = zo + § [palt)yr(t) + Ge (t) z1 (O] dt, 


where y (x) and z,(x) are again continuous in the interval I, and so on. 
In general, the (n + 1)th approximation will be given by: 


a ) 

Yn (%) = Yo + f A E) Yn—1 ($) + Q (É) 2n—-1 (t)] dt | 
7 (5) 

2, (£) = Zo + f [Pe (t) Yn—a (E) + ga (t) Zn (t)] dt. | 


The coefficients of equations (1) are assumed continuous in J, so that 
their absolute magnitude will not exceed a definite positive number 
M in the interval [I, 35]: 

|p| <M; |po(x)|<M; |qo(u)| <M (win I). (6) 


l 


Also, we let m denote the greater of the two positive numbers 
| Yo | and | 2, {, i.e. 


(Yol <m; |z| <m. (7) 


We shail in future consider only the part of I to the right of 2p, 
ie. we take x — x, > 0. The left-hand part can be considered in the 
same way. 

We find the difference between two successive approximations. 
The first of equations (4) gives: 


x 


Yı (x) — Yo = f [p (t) Yo + 9: (t) zo] dt . 
iXo 
We replace all the magnitudes under the integral by quantities 
greater than or equal to their absolute values and find, using (6) 
and (7) [I, 95]: 
x 
[Y (#%) — Yo] < | (Mm + Mm) dt ; 


Xo 
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i.e. 
|Y (£) — Yo | < m: 2M (x — ay), (8) 


whilst similarly 
| zı (x) — zo| < Mm- 2M (x — xo). (8) 


The first of equations (5) becomes, for n = 2: 
x 
Yo (x) = Yo + f [p; (t) yi (t) + q (4) 21 (t)] dt, 
Xo 


and we have, on subtracting the first of equations (4): 


| Ya (x) — y, (x) | = f {p (4) [y (4) — Yo] + qi (4) [ % (t) — (zo]} dé. 
If we again replace the magnitudes under the integral with their 
absolute values and use (6), (8) and (8,), we get: 
1Y2(#)— y()|< f {M-m-2M (t — x)}+{M-m-2M (t—x,)! dt 
or 


| Yo (x) — y (x) | < 22mM?2 f (¢— Xo) dé = m-22M2 [É e 


3 
t=x, 
Xo 0 


whence finally, 


| ye (2) — yı (2) | < m PEZA a 


Similarly, we have: 


mM ea)? 


r- (9) 


| 22 (x) — 2 (£) | < 


We continue by taking the first of equations (5) for n = 2 and 
n = 3, and find on subtraction: 


ya (2) — ya (2) = F {p0 [0 — y 0] +a O [ead — 2 O} dt. 
We have by using (6), (9) and (9,) as above: 


iya) — yle) |< 2R f E tde 


whence 


lys (2) — ya (x) | < m BHE =F , 
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and similarly, 


2M (z — 3 
| 23 (£) — 2 (a) |< m EM (= el d 
By proceeding thus, we can write the difference between two 
successive approximations as in general: 
2M (x — k 
[Yn (2) — Yra (2) | < m EET 
[2M (z — %)J” uo 
| 2n (2) — zn (@) | < m ERE Aol 
We use these values of the differences to show that y,(x) and 
2n(z) converge uniformly to limits y(x) and 2(x) respectively on 
indefinite increase of n.t We give the proof for the sequence of functions 
yn(x). We can replace this sequence by the infinite series 


Yo + [Y (£) — Yo] + [u (x) — Yı (x)] + 
F [ys (x) — Yz (x)] F mig F [Yn (x) — Yn-1 (x)] F ...3 (11) 


the sum of the first (n + 1) terms of which is equal to y,(x), so that 
we have to prove the uniform convergence of series (11) [I, 144]. 
If l is the length of the interval J in which æ varies, the first of 
expressions (10) shows that the terms of (11) do not exceed in absolute 
value the positive numbers 
(2M1)" 


n! 


m (n=1,2,...), 


whilst the series consisting of these numbers converges by d’ Alembert’s 
test, since the ratio of two adjacent terms is 2MI/n, which tends to 
zero with indefinite increase of n. The same result follows from the 
expansion of e” [I, 129]. Series (11) is therefore uniformly convergent 
in the interval I by Weierstrass’s test [I, 147], i.e. y,(x) tends uniformly 
in this interval to some function y(x). Similarly, it can be shown 
that sequence z,(x) tends uniformly in J to some limit function z2(x); 
thus we have, for x in I: 
lim y, (x) = y (x); lim z, (x) = 2z (x). (12) 
n>o n> œ 
Since y,(x) and z,(x) are continuous in J, the same can be said for 
their limits y(x) and z(x) [I, 145]. 


+ It is essential to recall, for what follows, the sections on series with variable 
terms and uniform convergence in Volume I. 
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We remark that (x — x) must be replaced by (x, — x) in the 
right-hand sides of inequalities (8) and (8,) when dealing with the 
part of I to the left of x, where æ — z < 0. We have to replace 
(t — £o) by (£o — t) in the further working, and so on. Inequality 
(10) remains valid throughout J, assuming (x — x) is replaced by 
its absolute value. 

We now show that the limit functions satisfy equations (3), i.e. 
equations (1) with initial conditions (2). This follows immediately 
from expressions (5), on passing to the limit with n— œ on both 
sides of the equations. Now, y,(x) and yp_,(t) tend respectively to 
y(x) and y(t), whilst z,(xz) and z2,,(f) tend to z(z) and z(t), so that 
we get equations (3) for y(x) and 2(x) in the limit. We give a rigorous 
proof of the passage to the limit. It follows from (12) that: 


a [p (t ) Yn- (t) + Q (t)2n—a (6)] = P: (E) Y () + 9, (6) 2 (8); 


Tim [aO 0+ Oam ODD f 7) 


We show that these passages to the limit occur uniformly with 
respect to ¢ in the interval I. We confine ourselves to the first expres- 
sion and work out the difference between the limit and the variable: 


| [p (2) y (t) +a (8) 2 ©] — [pr © Yn () + A (H Zn- H| 

< |p, (t) Ily @) — Yn- (t J| + lg Hl (6) a z (t) — Zn-1 (t) |. 
Since Ynt) and 2,-,(¢) converge uniformly to y(t) and z(t), given 
any «> 0, there exists a number NV, the same for all ¢ in J, such 
that 


WO — Yn O< r O a Ol gr for n>N. 
Hence it follows, by (6), that we have the inequality, for any ¢ of I: 
ED y(t) + galt) 2] — PLE) Yni) + anle for n>N, 


which proves the uniform passage to the limit in formulae (12,) 
throughout the interval J and in any part of it (£o x). We turn back 
to expressions (5) and use the fact that passage to the limit under 
the integral sign is possible for uniformly convergent sequences [I, 145]; 
having passed to the limit, we get equations (3) for y(x) and z(x) 
from these expressions. 

To sum up, we can say that the method of successive approximations 
has given us a solution of system (1) with initial conditions (2), i.e. 
we have proved the existence of a solution. We now show that the 
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solution is unique. Let equations (3) have two solutions: y(x), 2(z) 
and Y(x), Z(x). On substituting in (3) first one solution, then the other 
and subtracting, we get: 


x 


ylz) — Y (a) = f {pr (9 Ly) — YO] + p (8) [e(t) — Z)]} dt 


Xe 


ax) — Zle) = § {Ipa (y(t) — Y ()]-+ pa (0) [elt) — Zo] ae. 


(13) 


We take an interval I, of length } to the right of xo, so that 2M1, = 0 
is less than unity. We show that the two solutions coincide in this 
interval. If this were not the case, the absolute values of the differences 


ly(z) — Y (x)|, |a(x) — Z(x)| 


would have a positive maximum in J,, which we denote by the num- 
ber 6. Let the maximum be attained by the first difference at 
a= &, ie. 

ly(é) — Y(&)| = ô (14) 
and 


ly(w) — Y (m)| < ô and |e(x)—Z(a)|<5 (win I). (4) 


We take the first of equations (13) for x = €, and apply the same 
inequality for the integral as above; this gives us, by (14): 


ly(&) — ¥(€)| < 2M6 (£ — ap), 
whence, using (14) and the fact that & belongs to interval Z”, 
6<2M1,6, ie. ô< 66, 


which is impossible, since 0 < 0 < 1 by hypothesis. 

Our assertion that the solutions y, z and Y, Z do not coincide in 
the interval J, is therefore absurd. We can cover the total interval I 
by intervals of length l, and thus prove the identity of the two solu- 
tions throughout T. 

We state the final result: system (1) with initial conditions (2) has 
a single definite solution which exists in the interval I of continuity of 
the coefficients of the system; also, the solution can be obtained by the 
method of successive approximations. 

This result is also valid in the case when I is an open interval 
c<2«<d, since, by what has been proved, we have the existence 
and uniqueness of the solution in every finite closed intervala < xz < b, 
which contains the initial point x, and lies inside the interval c < x < d. 
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We might equally have considered a non-homogeneous system, 
i.e. with the extra functions f(x), fa(x), continuous in the interval J, 
added to the right-hand sides of equations (1). The foregoing proof 
remains in force here. 

The linear equation of the second order: 


y” + p(z)y’ + g(a) y=0 (15) 


can be written in the form of a system, on introducing an unknown 
z = y’ in addition to y: 


d d 
Gh =z; SS = — ple) z — gle) y, 


The result stated above is therefore also justified for (15), with the 
initial conditions 
Ylcaxo = Yo? Y'xaxo = Yor (16) 
in the interval I of continuity of the coefficients p(x), q(x). 
We can use conditions (16) to re-write equation (15) as 


x x 

Y =Y + mr — f de f [p(x)y’ + g(x) y] dz, (17) 
Xo x} 

where we can replace the double by a single integral in accordance 

with expression (23) of [15]. Equation (17) makes it possible to apply 

the method of successive approximations to (15) without reducing 

the equation to a system. 


Example. We apply the method of successive approximations to the example 
of [46]: 
y” — xy =0, 


We take the initial conditions y|,x_) = 1 and y’ |.) = 0. Equation (17) 
now becomes: 


x x 
y=1 + f dæ fay dz. 
ò ò 


On substituting y = 1 on the right-hand side, we get the second approxi- 
mation: 
x 


x 
x3 
n (2) =1+ faz fzaz=1 + 37.3 
o 0 


The third approximation is: 


xs 


0 0 
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On passing to the limit, we clearly get the power series: 


za 


xs + 


w+... 


which we had in [46]. 


51. The case of a non-linear equation. The existence and uniqueness 
theorem may also be proved for non-linear equations by using succes- 
sive approximations, though the final result is stated somewhat 
differently. We shall take for 
simplicity a first order equation: Y 


y’ = f(x,y) (18) 


with the initial condition: 


res ae 


(Xo Yo) 
oe = Yo (19) 


We suppose that the given 
function f(x, y) is continuous in 
the neighbourhood of the initial 
point (zo Yo) and has a bounded 
derivative with respect to y in this neighbourhood. More rigorously, 
a rectangle Q exists in the zy plane (Fig. 27): 


Fia. 27 


po - AX T +a; | (20) 


hey oe 


in which f(x, y) is continuous, has a partial derivative with respect 
to y, and where 


tw) <k, (21) 


k being a definite positive number. We can show, as in the case of a 
linear equation, that (18) with initial condition (19) is equivalent to 
the equation: 


Y = Y + f ft y(t)] dt. (22) 


We assume here that the interval of variation of z does not exceed 
(to — 4a, £o + a), whilst the value of the continuous function y(x) 
does not lie outside (Yyọ— b, Yọ + b), i.e. we assume that the point 
with abscissa x and ordinate y(x) belongs to the rectangle Q. 
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Subtraction of successive approximations will lead to expressions 
similar to (4) and (5): 


Yı (2) = Yo + Ñ Ht, yo) at. 285 
p (23) 
Yn (x) = Yo F f fit, Yn-1 (#)] dt. 


We turn to condition (21). If two points (2,, y,) and (£i, Y2) of Q are 
taken with the same abscissae, we can use the formula of finite 
increments to write [I, 63]: 


f(y, Yo) — A Y) = lY — mil a hs 


where y, lies between y, and y, Condition (21) gives with this: 


fa Ya) — fle Y) < klyz — yl. (24) 


This inequality is usually called a Lipschitz condition and is used 
for proving the convergence of yn(x) and the uniqueness of the solution. 
Let M be the greatest absolute value of the continuous function 
f(x, y) in rectangle Q, i.e. 


zy <M [(x,y) in Q]. (25) 


In carrying out the subtractions of (23), care must first of all be 
taken that the points with abscissae x and ordinates yn(x) do not 
pass outside rectangle Q, as defined by conditions (20). The first of 
these conditions gives the inequality | x — x)| < a for x. The second 
leads to the inequality: 


Yn (%) — Yol < b. (26) 


For this inequality to be satisfied for any n, x has to be subjected 
to the condition | x — z, | < b/M, as well as | x — z£, | < a; thus we 
finally get for x: 


b 


T: (27) 


le — a <a; je z < 


We show that, with this, all the approximations satisfy inequality 
(26). The first of equations (23) gives 


y (2) — yo = § Hit, yo) dt, 
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and if we evaluate the integral as usual, using (25), we get: 


ys (x) — yo] < M |x — x], 


hence, by the second of conditions (27), | y (£) — Yə | < b, so that (26) 
is satisfied for n = 1. Furthermore, the function y(x) defined by 
the above expression is obviously continuous, given that conditions 
(27) are observed. Having noted these points, we can now pass on 
to find y(x) with n = 2 in (28): 


Y (x) — Yo = fient (t)] dż, 


whence, as above, 
b 
\Ye (x) — Y < M |x — zo| < M -r = 5, 


i.e. (26) is also satisfied for n = 2, and y,(x) is clearly continuous, 
given conditions (27), and so on. We can thus find successive approxi- 
mations y„(x) in the interval (x), — c, £o + c) where, by (27), c is 
the lesser of the two numbers a and b/M. Let us call this interval T. 
All the y,(x) are continuous in J, and in all future arguments we 
shall assume that 2 belongs to J. 

We now consider yn(£) — yn-,(x), and take x — x, > 0 for simpli- 
city, as above. By (25), the first of equations (23) gives: 


ly (x) — Yoj < M(x — a). (28) 


We take the second of equations (23) with n = 2 and subtract 
from the first: 


ya (2) — y; (2) = J (ft, ys (I — Ht, yo)} dt, 
whence [I, 95] 
lya(z) — yy (x <f FE, Y1 ()] — F(t, Yo)| dt, 
or, by (24): 
elo) =y o< f Rn (O — alae, 
We obtain further, on using hedig (28): 


| Ya (x) — yı (x) | < kM f (¢ — zo) at = kM [N l 


tx 
Xo 
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and finally, 
[ya (a) — y (2) | < kar E. (29) 


We now write the second of equations (23) for n = 2 and n = 3 
and find, on subtraction: 


ys (2) — yala) = § (E y ©] — [6 y: ©} ae. 


Xe 


This gives us, as above, on using (24) and (29): 
dye (2) — (2) | < eae ETE 


and by proceeding in this way, we arrive at the general inequality: 


M [k(x — %)]" 
ko mk 


n! (30) 


[Yn (x) — Yn-1 (x) | < 
If we replace (x — 2,) on the right by its absolute value, the inequality 
is valid for all x of Z. It follows from this inequality, as above, that 
in the interval I y,(x) tends uniformly with respect to x to a limit 
function y(x), which is continuous and satisfies inequality (26), 
ie. | y(z)— Yal <b. Hence, points with abscissae x and ordinates 
y(x) belong to rectangle Q. We have by the continuity of f(z, y): 


jim ft, Yn O= Fiy] (from 7). 


The uniformity of this passage to the limit with respect to ¢ in I 
is easily seen as follows. For any given positive æ, there exists a ô, 
by the uniform continuity of f(x, y) in Q, such that | f(x", y”) — 
— f(x’, y’)| < e if (x, y’) and (z”, y”) are points of Q for which 
|æ” — aw’ | <6 and |y” — y’ | < 6. Furthermore, since y,—,(t) tends 
uniformly to y(t), there exists an N, the same for all t of Z, such that 
| y(t) — Yn(t) | < 6 for n > N and all t of I. Hence it follows that 
for all ¢ of I: 


IFE y O] — F ft Yn OJI E n>N, 


which proves that the passage to the limit is uniform. We return 
to the second of equations (23) and pass to the limit on both sides 
with indefinite increase of n. We can pass to the limit under the 
integral sign, by the uniform convergence of f{t, Yn—(t)] to f[t, y(t)], 
and we thus get equation (22) for the limit function. 

The uniqueness remains to be proved. Let equation (22) have two 
solutions y(x) and Y (æ) in some interval (£o — d, x, + d) which lies 
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inside (z4 — 4, % + a), where d can be taken small enough for y(x) 
and Y(zx) not to lie outside (yọ — b, yy + b). We substitute first one, 
then the other solution in (22) and subtract: 


y (x) — Y (a) = f (fly | —s[t Y Oat 
whence i 
ly (x) —Y(x)|< i iey] — tii Y@]i|dé (t> a), 
and by (24): k l 


ly (æ) — Y (@)|<k f ly — Y de. 
Xo 


We take an interval of length l such that kl, = 0 is less than unity 
and show, as previously, that y(x) and Y(zx) coincide. Hence, with 
the assumptions made regarding f(x, y), equation (18) with initial con- 
ditions (19) has a definite solution which exists in the interval (x, — c, 
Ly + c), where c is the lesser of the numbers a and b/M, and which can 
be obtained bythe method of successive approximations. We notice that 
the definition of the interval in which x varies is more complicated 
for a non-linear equation than for a system of linear equations, where 
it is simply the interval of continuity of the coefficients. We explain 
the matter more precisely by an example. 


Example. We take the equation: 


yY=aet+y (31) 
with the initial condition 
Y lx-o =0. (32) 
Equation (22) becomes: 
x 
y= f [t+ y*(t)] de. (33) 
0 


We replace y(t) on the right by zero and find the second approximation: 


y, (%) = Í ta= = A 
We get the third approximation by substituting this in the right-hand side 
of (33): 


x 
tt 2 5 
Ye (2) = f brie 
0 


and so on. 
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We now consider how to define the interval of variation of x in which we 
may apply the method of successive approximations. The right-hand side of 
equation (31) is continuous and has a bounded derivative with respect to y 
in any rectangle drawn about the point (0, 0), so that the a and 6 in conditions 
(20) can be taken arbitrarily. With this, M = max | {x + 42| =a + 6%, and 
inequalities (26), defining the required interval of variation of x, become 


[s| <a; ieee. 
: a+ b 


If b is taken as either large or near zero, the second inequality results in 
a very narrow interval. The same is true if a is taken as large, whereas small 
a also gives a narrow interval, by the first inequality. It follows that we cannot 
have as large an interval as we please for xz, although the right-hand side of 
(31) has no singularities for finite x and y. 


52. Singular points of first order differential equations, If the right- 
hand side of the equation 


y =f (x,y) (34) 


is continuous and has a bounded derivative with respect to y at and 
in the neighbourhood of the point (xo, Yo), the existence and uniqueness 
theorem shows that one, and only one, integral curve passes through 
this point. A point at which the above conditions are not satisfied 
by f(z, y) is called a singular point of the equation, and the existence 
and uniqueness theorem is no longer valid at such a point. 

We re-write (34) in a form containing the differentials: 


dx __ dy 
P(x,y) Q(z, y)’ 
and for simplicity, we take P(x, y) and Q(z, y) as polynomials in x 
and y. If P(x», Yo) # 0, (35) can be written as 


(35) 


and given this condition only, the right-hand side is continuous at 
and in the neighbourhood of the point (£o Yo) and has a bounded 
derivative with respect to y, as found by the usual rule for differentia- 
tion of a fraction. The conditions for the existence and uniqueness 
theorem are thus satisfied at (£o Yo) if P(£o Yo) # 0, and one, and 
only one, integral curve of equation (35) passes through this point. 
If P(X, Yo) = 0 but Q(x, Yo) # 0, (35) can be written as 


da _ P(x,y) 
dy Q(z, y)’ 
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with x taken as a function of y. The denominator of the right-hand 
side does not vanish at (£o, Yo), and we can see, as above, that the 
existence and uniqueness theorem is applicable at this point. The 
singular points of equation (35) are thus the points at which P(x, y) 
and Q(x, y) vanish simultaneously, their coordinates being found as 
the real solutions of the system: 


P(a,y)=9; Q(x,y) = 0. (36) 


What has been said also applies to the case when P(x, y) and 
Q(z, y) are series, expanded in positive integral powers of (x — xo) and 
(y — Yo). If the constant term of at least one series differs from zero, 
the existence and uniqueness theorem is applicable at (x9, Yo); other- 
wise, we have a singular point of the equation. 

The idea of a singular point may be explained by the example 
of steady fluid flow [12]. Let P(x, y), Q(x, y) be the projections of 
the velocity vector v(x, y) on the coordinate axes. Equation (35) 
expresses the condition that the tangent is parallel to the velocity 
vector and, in fact, is the differential equation of a stream line. At a 
point where v(x, y) differs from zero, at least one of its projections 
P(x, y) and Q(z, y) must differ from zero, and one, and only one, 
stream line passes through the point, by the existence and uniqueness 
theorem. On the other hand, points where v(x, y) vanishes, i.e. 
where equations (36) apply, are singular points of (35) and are called 
critical points of the flow in question. A different situation is obtained 
here: stream lines can intersect at the point, approach the point 
asymptotically or form a closed curve round it. Singular points can 
thus be of various kinds, and it is important to know which kind 
when studying the motion (the integral curves of the equation). 
We deal with the problem in a particular example in the next section. 


53. The stream lines of collinear plane fluid motion. We take the 
particular case when the projections of the velocity are first degree 
polynomials: 

P(#,Y) = an8 + aytb; Q(x, yY) = age + awy + by ; 
the fluid flow being spoken of as collinear in this case. 
We start by assuming that the straight lines 
aut + aY +b =0 and ayt + ay + b = 0 (37) 


are not parallel. The constant terms b, and b, vanish, on taking the point 
of intersection of the lines as the origin. The equation of motion 
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becomes 
dx dy 


Te , 
aut + ary azt + Ogg 


(38) 


for which the origin, x = y = 0, is obviously a singular point. We 
show how the nature of the singular point is revealed by the form 
of the coefficients air. 

Clearly, (38) is a homogeneous equation and can be integrated by 
the method given in [3]. We use another method, however, which 
consists in introducing new variables € and 7, so as first to reduce 
(38) to a more convenient form for direct inspection. 

We put 

E= matny; n= M£ + my, (39) 
whence 
dé = m, dx + ndy; dy = m, dz + n, dy. 
We get from (38), on eliminating dz and dy: 


dé = dn 
My (OX F- yyy) + My (Ant + aY) Mg (An + a124) + Me (ant + azzy) ` 


(40) 


We now define the coefficients in (39) in such a way that the 
denominators of the fractions written are proportional to & and 7 
respectively. The first denominator gives us 


My (At + Ayo Y) + Ny (azt + aY) = o( Myx + nY), 


whence, on comparing coefficients of x and y, we obtain a system 
of homogeneous equations for m, and n: 


(Qi — o) mM + azn = 0 | (41,) 
aM, + (aa — 0) n = 0. 


Similarly, on equating the second denominator to oy, we get the 
system for m, and n: 


(aii — @) Mg + aono = 0 | (41,) 


Ay2Me + (aog — E) Nz = 0, 


where the coefficient of proportionality ọ will now have a different 
value. 

We cannot have m = n = 0, since the change of variables of (39) 
then becomes meaningless. Systems (41,) and (41,) must thus have 
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solutions differing from zero. But two homogeneous linear equations 


ax + piy =0; aw + By = 0 


have a solution differing from 2 = y = 0 when, and only when, the 
straight lines corresponding to the equations coincide, i.e. when 
their coefficients are proportional. With systems (41,) and (41,), this 
amounts to the condition 

anue On 


> 
Aig a22 — E 


leading to the quadratic equation for ọ: 


0? — (a11 + Aza) 0 + (Araog — Ay24q1) = 0. (42) 


With this, systems (41,) and (41,) both reduce to a single equation, 
and we can determine the solution differing from m = n = 0. 

We now consider in detail the various possible cases. 

(A) Equation (42) has two distinct roots p, and pz. 

On substituting @ = o in (41,) and g = g, in (41,), we can determine 
the coefficients in (39), as described above; equation (40) then reduces 
to an equation with separable variables: 


dé dy 


Ce oe (43) 


It may be shown that, with this, equations (39) are soluble with 
respect to x and y. 
We proceed by considering various particular cases of case A. 
(1) The roots p, and p, of (42) are real and have the same sign. 
Integration of (43) gives us: 


log + = log 7% + log O, 


where log O, is taken as the arbitrary constant. 
Hence: 


1 1 
a = Oe; = One (C= C0), 


or 
(myx + my) = C (mt + ney)er . (44) 


The fraction 0,/0. is positive in the present case, so that the coordi- 
nates x = y = 0 satisfy (44) for any value of C, ie. every stream 
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line (integral curve) has a singular point (Fig. 28). A point of this 
sort is called a node. 

(2) Roots o, and 0, real and of different sign. Here, 9,/ 0, is negative, 
and we write it as (— u), where u is a positive number; the general 
solution (44) now becomes: 


(m,n + x,y) (Mx + ny) = C (u > 90). (45) 


Fra. 28 Fie, 29 


We obtain C = 0 on substituting = y = 0, so that the equation 
of stream lines passing through the origin runs: 


(mix + niy) (Max + ny)” = 0 
corresponding to which we have the two straight lines: 
mz + n y= 0; 
My £ + ngay = 0. (46) 


It follows that in this case two and only two stream lines (integral 
curves) pass through the singular point, which is here referred to 
as a neutral or saddle point. The curves (45) are similar to hyperbolas 
for values of C differing from zero (or actually are hyperbolas for 
u = 1), whilst the straight lines (46) are their asymptotes (Fig. 29). 

(3) Roots pọ, and go, complex conjugates, with non-zero real parts: 


Q=a+ fi; eg =a—fi (a and B40). 


If we substitute conjugate values of 9 in the coefficients of systems 
(41,) and (41,), we get systems whose corresponding coefficients are 
conjugates. Replacing i by (—7?) in any solution m,, n, of one system 
therefore gives us the solution m,, n, of the other system. It follows 
from (39) that & and 7 can also be taken as conjugates: 


E= b tmi n= mi, 
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where & and y, are real polynomials in x and y of the form 


f= pt H Hy; M = Pett gey. (47) 
Equation (43) becomes: 
dé, + idm dé, — idy, 


(@+Bi)(&+mi) (a — pE mi) 
or 


dé, + idn, dé, — i dm 


(a$, — $m) + (BE, + an) i =< (a$, — Bn.) — (B&, Ham) i ` 
We obtain in the usual way, on forming new proportions, by first 
adding numerators and denominators, then subtracting them: 
då = dn, 
af, — fm BE, + an ’ 


whence 
éd, + m dn, = + (êd, — m då) 
or 
& dë, +mdmn _ a 1 | mdm — mdk 
i+ ni P jp% éi A 
On writing 
RE >- 2. — Mi 
u = 1 + ni ; v= E , 
1 
we get 
du d 1 
a = F Tro ; -z Ogu = -7 aro tan v + log C 
and the general solution is therefore: 
2.72 — 4 n zI y2 goretan 
log t +i = -p aro tan = + log C or Vë -+ ni = Ce : 
1 
(48) 
We obtain 
2a 
r= 0e? j 


on taking polar coordinates ,;=rcos@ and m =rsin0 in the 
(či m) plane, i.e. the stream lines are logarithmic spirals in (§&, 7,) 
coordinates, approaching the origin from the same direction [I, 83]. 
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The spiral forms will be maintained on passing to the original (x, y) 
coordinates, related to the (é, m) coordinates by (47). It follows 
that no stream line passes through the singular point in the present 
case, whereas they all approach it asymptotically in spirals (Fig. 30). 
A singular point of this kind is called a focus. 


Fie. 30 


(4) Roots o, and p, pure imaginary (+f i). We obtain on putting 
a = 0 in (48): 


éi + nt = C, (49) 
or in the original coordinates: 
(Pit +414)? + (Paz + qy)? = C?. (50) 


We get similar ellipses instead of circles (49). None of the stream 
lines (integral curves) pass through the singular point in this case; 
they form closed loops about the point (Fig. 31), instead of spiralling 
round it as in the previous case. A singular point of this sort is called 
a centre. 

(B) Equation (42) has a double, non-zero root p, = p, Two cases 
are possible on substituting @ = ọ, in the coefficients of systems 
(41,) or (41,): either all the coefficients vanish, or there is at least 
one coefficient differing from zero. We have in the first case: 


Qiz = Ay, = 0; Qi = ly = P), (51) 
and system (38) takes the form: 


dx __ dy x dy 
QT Qay ©. Y 


its general solution is a family of straight lines passing through the 
origin, i.e. the origin is a node. 
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At least one of the coefficients 


Qiz A21 Ay, — li Aza — Oy 


differs from zero. It is easily shown that a,, and a, cannot both be 
zero here; for, if ai = aa = 0, we get @,, = üy = 0, by the fact 
that p, is a double root of (42); equation (42) must reduce to 9? — 
— (du + a») @ + Q a. = 0 on the assumption made, and the con- 
dition for a double root gives a,, = ay from this equation, the common 
value of a, and a. being the value of the double root. Conditions 
(51) are therefore satisfied if we take a, = ay = 0, and this contra- 
dicts our hypothesis. Thus at least one of the coefficients a), or a, 
differs from zero. Suppose, say, @,, # 0. The double root of (42) is 
now clearly: 
a= aus + "e, 

and on substituting @ = ọ, system (41,) must reduce to a single 
equation, as remarked above: 


Qi — Gre poe 
Su A m, + dy, n = 0. 
We take m, = an and ny = — (a4, — G)/2, i.e. 


air — Gag 


E = ayt — Y, (52) 


and retain y as the second variable. The differential equations can be 


written as 
dé dy 


QF Ont + ary 
or, on replacing æ by its expression in accordance with (52): 


aÈ _ __ dy 
ai E+toy 


On introducing a new variable ¢ instead of y: 
y = tE, 
we arrive at the equation 
1 d 
NE = dt, 
and integration gives us the general solution: 


y= Ž log (CE). 


1 


162 LINEAR DIFFERENTIAL EQUATIONS [53 


The sign of E must be the same as the sign of O, whilst obviously 
y—> 0 as — 0; in addition, 
i 1 
y' = = [1 + log(0$)] — <œ, 
i.e. the integral curves in (&, y) coordinates meet at the origin, where 
they touch the OY axis (Fig. 32), so that the origin is a node. 
A basic assumption in the transformation of equation (35) to the 
form (38) was that the straight lines (37) are not parallel. If these 


Y 


Frc. 32 


are parallel, their left-hand sides do not vanish identically, and the 
differential equation of the stream lines: 


dg = dy 
Ay tay +d, ang tagy +b 
has no singular points; now, one and only one stream line passes 
through every point of the plane. 
In a more general case than that of collinear motion, the equation 
with a singular point at the origin has the form: 


dg dy 


= FF eee Sees (88 
Ay © + Ay y + 0, v2 4+ 6, ry +... az L + Ago y + b, 2% + ewy +... (53) 


The denominators here contain terms of higher order than the 
first in x and y. Integration of such equations does not lead to a 
quadrature except in special cases, though the nature of the singular 
point can sometimes be found from the initial coefficients in the 
denominators of (53) without in fact carrying out any integrations. 
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The argument runs roughly as follows: for x and y of small absolute 
value, i.e. in the neighbourhood of the origin, the higher order terms 
in the denominators of (53) will be small compared with the first 
order terms and the form of the integral curves near the origin may 
be supposed to be the same as it would be if we only considered the 
first order terms in the denominators. If this is so, we encounter the 
same sorts of singular point with equations (53) as were obtained 
above for collinear motion. These suppositions are correct, with 
certain exceptions. We state the final result, without proof: 

(1) If the roots of quadratic equation (42) are real and distinct 
and have the same sign, the singular point x = y = 0 of equation 
(53) is a node. This means that every integral curve that approaches 
sufficiently close to the singular point strikes this point. 

(2) If the roots of (42) are rea: and have opposite signs, the singular 
point is a saddle point, i.e. two integral curves pass through it. 

(3) If the roots of (42) are complex, with non-zero real parts, the 
singular point is a focus. 

(4) If the roots of (42) are pure imaginary, the singular point is 
a focus or centre. 


CHAPTER III 


MULTIPLE AND LINE INTEGRALS. 
IMPROPER INTEGRALS. 
INTEGRALS THAT DEPEND ON A 
PARAMETER 


§ 6. Multiple integrals 


54. Volumes. We have so far considered the definite integral 
b 
f f(a) da, 
a 


as the limit of a sum, when the function f(x) is defined over the segment 
(a, b) of axis OX. In other words, the domain of integration has 
always been a straight line. 

We generalize the concept of integral in the present article to 
include the cases when integration is over a plane domain, or a domain 
in space, or finally, over a surface. We shall make use of intuitive 
ideas of area and volume for the present, and shall not dwell on 
certain points arising in connection with passage to the limit. The 
reader can find the main points of a rigorous treatment in a later 
section of this chapter. We start with double integrals, which have 
the same sort of connection with the calculation of volume as the 
integral written above has with the calculation of area. We shall 
consider the calculation of volume as a preliminary to our treatment 
of double integrals. 

We know that the area bounded by the curve y = f(x), the x axis, 
and the ordinates z = a, x = b, can be evaluated with the aid of a 
definite integral, and is in fact expressed by the integral written 
above [I, 87]. 

We turn to the analogous problem regarding the volume v of a 
body bounded by a given surface (8), whose equation is 


z = f(x,y), (1) 
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by the XOY plane, and by the cylinder (C) with generators parallel to 
the z axis which project (S) into the area (o) in the XOY plane (Fig. 33). 

We reduced the calculation of the 
volume of a body to a definite integral 
in [I, 104], where all we required to 
know were parallel sections through 
the body. We use this method in the 
present problem. 

We suppose for simplicity that the 
surface (S) lies wholly above the XOY 
plane and that the contour (l) bounding 
(o) is intersected at two points only 
by lines drawn parallel to the coor- 
dinate axes. 

We dissect the body by means of 
planes parallel to the YOZ plane, which 
cut the XOY plane in straight lines 
parallel to the y axis (Figs. 33 and 34). Let the abscissae of the 
extreme cutting planes be a,b. These are also the abscissae of the 
points that divide the contour into two parts (1) and (2), these 


parts being the loci of the points of entry and exit respectively of 
lines parallel to the y axis into the domain (ø) (Fig. 34). Each part 
has its own equation: 


Y = pı (£); Yo = P (1). (2) 
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The area of the section of the body cut by the plane PQ, distant x 
from YOZ, depends on x. Let this area be S(x); we have [I, 104]: 


v= ( S(x) dx. (3) 


It remains to find an expression for function S(x), which is the 
area of the figure M, N, N, M,, lying in the plane PQ, and bounded 
by the curve of intersection N,N, of plane PQ and the surface 
(S), by M, M,, parallel to OY, 
and by the two ordinates M,N, 
and M, N,. 

Since œ% is constant for all 
points of the section concerned, 
the ordinate of the curve N,N, 
can be taken as a function of y, 
given by the equation: 


z= Í (x, y) 
with constant 2; with this, the 
independent variable y varies in 
the interval (y,, Y2), where y, and 
Yz are the ordinates of the points of entry and departure respec- 
tively of the straight line M, M, into or from the domain (ø). 
We can write by [I, 87]: 
Yı 
S(x) = f f(x,y) dy; 


yı 


Fie. 35 


and on substituting in (3), we have: 


b Yı 
v= f dx f f(x,y) dy. (4) 
a y 
Here we have expressed the volume as an iterated integral, where 
integration is first carried out with respect to y with constant v, 
then the result obtained integrated with respect to v. 
If we dissect the given body with planes parallel to XOZ, we can 
express the same volume as 


v= f dy f f(x, y) dz, (5) 


Xı 


where x, and x, are known functions of y: 


£1 = Yı (Y); £z = Ya (Y), (6) 
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and a and f denote the extreme values of y on the conteur (l) (Figs. 
33 and 34). 

Two assumptions were made in deducing expressions (4) and (5): 
(1) that the surface (8) lies wholly above the XOY plane, and (2) 
that the contour (l) of the projection (c) of (8) on the XOY plane is 
intersected at two points only by any line parallel to a coordinate 
axis. If condition (1) is not satisfied, the right-hand sides of (4) and (5), 
instead of giving the volumes, give the algebraic sum of the volumes, 
volumes being considered (+) or (—) depending on whether they lie 
above or below the XOY plane. If condition (2) is not satisfied, 
as for instance in Fig. 35, where contour (l) is intersected at more 
than one pair of points by a line x = const., 
the domain (ø) must be divided into sub-domains, 2 
in each of which condition (2) is satisfied. Y 
The surface (9) and volume v are now corres- AN 
pondingly subdivided, and each partial vol- WAN 
ume can be calculated by means of (4). 7 


Examples. 1. The volume of a truncated rectang- 
ular prism (Fig. 36). The base is formed by the x 
and y axes and by the straight lines x = k, y =l. 
The cutting plane has the equation: 


X 
x y z 2 
= +-—-+—=1. 
à BOY Fig. 36 
Expression (4) becomes in this case: 
k i k ot k i 
7 2 EE E fe yt ewer ak Sa 
v= [as [edy= [def »(1-4-—-P)ay=» [ae (y- 3 - so) 
o 0 6 0 0 


k 
al 2 k?l kl k l 
=» |(-7- gte- ag) gr A 
0 


where øg is the base area and h is the ordinate of the point of intersection of 
the diagonals of the upper section (corresponding to x = k/2, y = 1/2). 
2. The volume of the ellipsoid 


a? y? z? 
ao to task 


If the ellipsoid is cut by planes z = const., ellipses are obtained with semi- 


axes 
2? z? 
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and with area 
z2 


so that the required volume is 


55. Double integrals. We obtained an approximate expression for the 
area of the curve y = f(x) [I, 87] by dividing it into vertical strips 
and taking the area of each as equal to that of a rectangle with the 
same base and with height equal to some mean value of the ordinate 
of the curve in the strip. As the number of strips increased and the 
width of each tended to zero, the error 
tended to zero, and the approximate 
formula became the definite integral 
in the limit, accurately expressing the 
area, 

An analogous method can be used for 
calculating volumes. We divide the 
domain (ø) (Fig. 37) into a large num- 
ber of small elements Jo of arbitrary 
shape. Here, Ao refers either to the 
element itself or to its area. We take 
each element as the base of a cylinder 
which cuts out an elementary volume from the volume v on being 
continued to its intersection with surface (8). The elementary volume 
can evidently be taken as approximately equal to the volume of the 
cylinder of base Ac and of height equal to the ordinate, ie. the 
value of z, of any point of the element of surface whose projection 
is 4o. In other words, if we take any point N of the element Ao and 
let the ordinate of the corresponding point M of the surface (S8), 
i.e. the value of f(z, y) at the point N, be denoted for brevity by f(N), 
we have f(N)4o for the elementary volume, and 


Fre. 37 


v~ af (N) Ao, 
0) 


where summation extends over all the elements do that make up 
the area (a). 
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The smaller the elements Ao and the larger their number, the 
closer the approximation obtained, and we can write in the limit: 


lim > f(N) do =v. 
(9) 


Turning aside from geometric concepts, we can define the above 
limit of a sum independently of the geometric form of the function 
{(N); the limit is called the double integral of function f(N) over the 
domain (c), and is written as: 


J { Edo = lim Z fW) 40. 


The existence of the limit written is self-evident, since, as we 
have noted, it necessarily gives the volume v described above. This 
is not a rigorous argument, of course, but a rigorous analytic proof 
of the existence of the limit is possible for fairly general conditions 
regarding f(N) and for continuous functions without exception. 

On setting f(N) = 1, we can express the area o of the domain (ø) 
as a double integral: 

o= f f do. 


(9) 


We give the detailed definition of a double integral: let (øo) be a 
bounded plane domain, and let f() be a function of points of this 
domain, i.e. a function that takes a definite value at every point 
N of (a). We divide (o) into n sub-domains with areas Ao,, AO ..., 
Aon, and let N,, Nz, ..., Nn be arbitrary points of the sub-domains. 
We form the sum of the products: 


SH (N+ bop 
k=1 


On indefinite increase in the number n of sub-domains and indefinite 
decrease of the area Ao, of each, the limit of the sum is called the double 
integral of function f(N) over domain (o): 


is f(N)do = lim X Í (Np) Aog. 


Remark. Let d; be the maximum distance between two points of 
the sub-domain of area do, (the diameter of the domain), and let d 
be the greatest of the numbers d,, d», ...,d,. The indefinite decrease 
of each of the Ao, referred to in the definition means that d —> 0. 
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If the magnitude of the integral is denoted by I, the above definition 
is equivalent to the following: given any positive e, there exists a 
positive 7 such that (cf. I, 87] 


I— X fN) 40, | <e, 
k=l 


provided d < y. A full account of the theory of multiple integrals 
appears at the end of the chapter and includes a rigorous definition 
of area, a more precise description of the domain (ø) over which 
integration can be carried out, as 
well as an explanation of how dis- 
section is possible into sub-domains 
and a proof of the existence of the 
limit of the sum mentioned above 
in the case of continuous functions 
{(N) and a class of discontinuous 
functions. 


56. Evaluation of double integrals. 
By considering a double integral as 
a volume, we can deduce a method 
for reducing a double integral to an 
iterated integral. 

If the domain (ø) is referred to 
rectangular coordinates, we can let 
the elementary domains Jo consist 
of rectangles of sides 4x, Ay formed 
by lines parallel to the coordinate axes (Fig. 38). Let (x, y) be the 
coordinates of the point M. We can now write: 


Fra. 38 


f(N) =f(z,y); Ao=AxdAy; do=dady 
and 


i f(N)do = lim = f(z, y) 4x Ay = SJ He, y) dx dy. 


On the other hand, on applying what was said in [54] about evalua- 
ting a volume in terms of an iterated integral, we can write: 


: B Xa 
f f Fæ n dedy = f def fæ ydy = fay f fedes (1) 


(2) Yı 
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which gives us the rule for evaluating a double integral, independently 
of the geometrical significance of function f(z, y). 

If the first integration is with respect to y, x is meantime reckoned 
constant, whilst the limits y, and y, are the functions of æ given by 
(2) [54]. And similarly, if the first integration is with respect to x. 
The limits of the first integration in the iterated integral are definite 
constants, independent of the variable of the second integration, 
only in the case when the domain of integration is a rectangle with sides 
parallel to the axes. If (c) is a 
rectangle (Fig. 39), bounded by 
the lines 


x=a; =b; y=a; y=, 


we have 


SS Fæ, y) dedy = 
(0) 


b 8 
= f de f f(e,y)dy = 


<= (ay i f(z, y) dx. (8) Fria. 39 


The quantity do = dg dy is called an elementary area in rectangular 
coordinates. 

We remark that the first integration of (7) with respect to y with 
constant x corresponds to summation over the rectangles consisting of 
strips parallel to axis OY, all these rectangles having the same 
width dz, which is taken outside the sign of the first integration. 
The second integration, with respect to x, corresponds to adding all 
these sums. We give an accurate derivation of (8) and (7) in the next 
article. 

If lines parallel to the axes intersect the boundary of (o) more than 
twice, we must proceed as mentioned in [54]. 

We naturally assume, here and later on, that the integrals spoken 
of exist [95]. A sufficient condition for thisisthat the integrands 
be continuous in (øo) as far as its boundary, which we shall assume 
to be the case, and that the domain (ø) satisfy the condition laid down 
in the basic discussion of integrals in [91]. 
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Now let (ø) be referred to polar coordinates. The equation of the 
surface (S) will now have to be written in the form z = f(r, y), where 
(r, p) are the polar coordinates. 

We now get the elements do by drawing the families of lines 
r = const. and p = const., i.e. concentric circles and radius vectors 
passing through the origin (Fig. 40). If we take the particular Ao 
formed by the circles of radii r and (r + Ar) and by the radii at 
angles p and (p + Ap), the curvilinear figure can be replaced to an 
accuracy of higher order infinitesimals by the rectangle of sides Ar 
and r Ag, so that 


Ao =r Ar Ag, 


and we can write: 


SSF) do = lim © f(r, 9) 1dr Ap = Sf f(r, v) rdr dg. 
a) (9) (G 


The function under the double integral obtained here is rf(r, ¢). 
The same rule as above can be used for reducing this to an iterated 
integral, except that the roles of x and y are played here by r and g. 

The first integration with respect to r at constant » corresponds 
to summation over the elements 4a contained between the two radii 
pand (p + dq), dp being taken outside the sign of the first integra- 
tion. The second integration with respect to p corresponds to addition 
of all these sums. When applying the rule, we first note the extreme 
values a and # of the argument p (the extreme values of x in [54]), 
then the radius vectors r, and 7, of, respectively, the points of entry 
into and departure from (ø) of the vector p = const. (which corres- 
ponds to finding y, and y, in [54]). Given these values, we have: 


[f iMd = ff fr, g)rdrdp = f dp f fr grdr, (9) 
(o (0) a T, 


where 7, and 7, are known functions of 9. 

Figure 40 illustrates the case when the origin lies outside the 
contour (l). If the origin is inside the contour, y can be taken as 
varying from 0 to 22 and r from 0 to r, for a given g, where 7, is obtained 
from the equation of the curve (1): r, = p(y), so that we have (Fig. 41): 


27 f, 
ff {(N)do = f do fiC, g)rdr. 


ta) 
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Fie. 40 


The expression 
rdrdy (10) 


is called an elementary area in polar coordinates. 
In particular, if (7) = 1, we get the expression obtained in [I, 102] 
for the area under a curve in polar coordinates: 


8 r, B 
f dy f rdr =~- f (rz — r?) do. 


a 


(The expression in [I, 102] is for the case when r, = r and 7, = 0.) 


Example. We find the volume included between a sphere of radius a and 
a right circular cylinder of radius a/2, passing through the centre of the sphere 
(Fig. 42). We take the centre of the sphere as origin, the XOY plane perpendi- 
cular to the axis of the cylinder, and the axis OX from the centre of the sphere 
to the point of intersection of the axis of the cylinder and the XOY plane. 
We can say by symmetry that the 
required volume is four times the Z 
part of the cylinder bounded by the 
ZOX, XOY planes and by the upper 
hemisphere. 

The domain of integration here is 
half the base of the cylinder, the con- 
tour of which consists of the semi- 
circle 


r = a COS p 


with the corresponding part of axis 
OX; the angle ọ varies from 0 to 
z/2, and the corresponding radius 
vector from the axis OX to axis OY. 


174 MULTIPLE AND LINE INTEGRALS [57 


The equation of the spherical surface,: 


x? + y? + 2% = a? 
can be written here as 


z= at— (a? + y?) z= Yae—r?. 


The required volume is therefore 


n bid 
2 acosy 7 3 
ee aS “4 |r=acos 
v=4 [dy f Ya? —r?r dr=4 fe- m] D t dp = 

ô 0 0 = 

n 

2 p7 

` 3 = 

=+| (a? — a? sin? p) dp = an [o+ cosp — 2E ] oi 

T 3 el fa ee 


57. Curvilinear coordinates. We defined an elementary area in the 
previous article, and considered the problem of evaluating integrals 
in the case of rectangular (x, y) and polar (r, y) coordinates. We now 
deal with these problems with any coordinates (u, v), where the new 
variables u, v that replace x, y are given by 


g(x,y) =u; p(z, y) =v. (11) 


If we fix the value of u and take v variable, we get a family of 
lines on a plane. Similarly, if we fix v and take u variable, we get 
a second family of lines. The lines of the two families can be either 
curved or straight (Fig. 43). 

The position of a point M on the plane is defined by a pair of 
numbers (x, y) or, by (11), by the pair of numbers (u, v), which are 
referred to as the curvilinear 
coordinates of the point M.We 
get expressions for the rectan- 
gular (x, y)in terms of the cur- 
vilinear (u, v) coordinates on 
solving equations (11) with 
respect to x and y: 


x= gp (u,v); Y= Py (u,v). (12) 


In the case of polar coor- 
dinates, u becomes r and v 
Fia. 43 becomes gy. The lines of the 
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constant u and of the constant v, mentioned above, are called coor- 
dinate lines in curvilinear coordinates (u,v); they form two fami- 
lies of lines (circles and radii in polar coordinates). 
We now find the elementary area do in curvilinear coordinates. 
For this, we consider the area M, M, M, M, (Fig. 43), formed by 
two pairs of infinitesimally distant coordinate lines: 


g(x,y) =u; p(x, y) = u+ du, 
yz, y) =v; ylz, y) =v +d. 


The coordinates of the vertices of M, M, M, M, are given to an 
accuracy of higher order infinitesimals [I, 68] by: 


(M) x, = g, (u, v); Yı = y (u, v). 
(Mg) ta = p, (u + du, v) = q, (u, v) + PEE) au; 


y=p (u+ du 0) = yp (u v) + EE au. 


(M3) x3 = g, (u + du, v+ dv) = q; (u, v) p Se (9) du + eP dy ; 
Ya = yı (u + du, v + do) = yy (u, v) + PE) du + PH?) dy 
(M,) 24 = 9, (u, v + do) = 9, (u, 0) + PH") do; 

Ya = y (m v + do) = y; (u,v) + PE dy, 


It follows at once from these expressions that 7, — 2, = %, — x, and 
Yz — Yı = Yz — Yay and hence the segments M, M, and M, M, are 
equal and have the same direction. The same can be said of M, M, 
and M, M,, ie. M, M, M, M, is a parallelogram to an accuracy of 
higher order infinitesimals; its area is thus twice the area of triangle 
M, M, M}, and therefore, by a familiar expression of analytic geo- 
metry: 
do = |X (Y2 — Ys) — Yı (La — Xs) + (LY3 — L3Ya) |. 


On substituting the expressions for the coordinates, we get the formula 
for an elementary area in any curvilinear coordinates: 


do = Pan an _ Prdy) Bos (eh 9) ay do = |D| du dv, 
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where D is called the functional determinant (or Jacobian) of the 
functions p (u, v) and y(u, v) with respect to u and v: 


D= Op, (u,v) Oy, (u, v) Op, (u,v) dp, (u, v) 
Ou w ov ðu s 


Finally, the formula for change of variables in a double integral is: 
S f æ, y) do = f | F(u, v) |D] du dv, (13) 
(0) to) 


where F(u, v) denotes the function of u and v to which f(z, y) is 
transformed as a result of (12). The limits of integration with respect 
to u and v are found from the form of (c) in a manner similar to that 
described in [56] for polar coordinates. 

We took u and v in transformations (11) as the new curvilinear 
coordinates of a point, and reckoned the plane invariable. Conversely, 
we can take u and v as the previous rectangular coordinates and let 
(11) transform the plane so that points with rectangular coordinates 
(x, y) become points with rectangular coordinates (u, v). Such a trans- 
formation changes (o) to a new domain (2), and (13) now has to be 
written as: 


{$ f Kæ, y) do = f f F(u, v) |D| du do, 
(0) (2) 


where the u and v arè the rectangular coordinates of points of the 
domain (Z) and the limits of the integration over (2) are found in 
the manner of [56]. On putting f(z, y) = F(u, v) = 1, we get an 
expression for the area o of the domain (c) as an integral over (X): 


o = Í f |D| du dv. 
iz) 


One consequence of this is that, from our new point of view, | D | 
at any point N of the domain (2) is the coefficient for the change of 
area at N on transition from (X) to (ø), i.e. it is the limit of the ratio 
of the area of a domain in (ø) containing the image of N to the area 
of the corresponding domain in (Z) which contains N, when the latter 
domain contracts to the point N. We deal with the change of variables 
in a double integral in more detail from this point of view in [77]. 


Examples. 1. We take the circle x? +- y? < 1 of unit radius and with centre 
at the origin in the XY plane. We introduce new variables in accordance with 
the formulae for passage to polar coordinates: «=rcosg, y =r sing but 
in fact take r and 9 as rectangular coordinates; thus a point with rectangular 
coordinates (x, y) becomes a point with rectangular coordinates (r, ). Clearly, 
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with this, our circle becomes a rectangle whose sides are the straight lines 
x = 0, x = 1, y=0, y = 2x (or r=0, r=1, p=0, p = 2x); the origin 
æ = y = 0 corresponds to the complete side r = 0 of the rectangle, whilst the 
opposite sides p = 0 and » = 2x both correspond to the same radius of the 
circle. If we use the rule expressed by (8) for reducing a double to an iterated 
integral, we can see at once that the limits for integration in polar coerdinates 
over our circle must be r = 0 and r = 1 for r, and » = 0 and g = 2n for g. 
The rules given in [56] for finding the limits of integration in polar coordinates 
can be similarly explained. 
In the present case 


__ G(r cos p) . a(r sin p) O(r cos p) . O(r sin p) - 


2 ðr Oy oy or , 


and, as we saw above, do = rdrdg. 

2. We take a right-angled triangle (0), formed by the coordinate axes and 
by the straight line 2 + y = a, as a further example of the second aspect of 
double integrals. The coordinates of points inside (o) must satisfy the inequali- 
ties: 

x>0; y>0; w~+y<a. (14) 


We introduce new variables (u, v) by putting 


x+ y=u, ay =u, 
i.e. 


or 


We take (u, v) as a new system of rectilinear rectangular coordinates. It follows 
from the last expression that inequalities (14) are equivalent in the new system 
to:0 <u <a,0 <v <a, which define a square (X) with a vertex at the origin 
and sides along the axes. Every point (x,y) of (o) corresponds to a definite 
point (u, v) of (X), and conversely. We get for D: 
a-—v u 


u v 
a a a 


D= 


u 
a’ 


and (13) takes the form: 
f fre y) dzdy = Í [Fe v) dude, 
(2) @) 
or, on introducing the limits of integration in accordance with (7) and (8) 


a a—x a a 


fac f f(y) dy = — f udu f F(u, v) dv. 
0 0 


0 0 


178 MULTIPLE AND LINE INTEGRALS [58 


58. Triple integrals. Instead of interpreting the double integral of 
[55] as the volume of a body, we can take it as representing the 
mass of a material distributed over the plane domain (ø). Suppose 
that Am is the amount of material on the elementary area 4o which 
contains a given point N. If the ratio dm/do tends to a definite 
limit as do contracts to N, this limit f(N}) gives the density of the 
surface distribution of material at the point N: 


ee | 

lim +" = fN). 
On dividing (ø) into small elements 4c, the mass on a single element 
will be approximately /(V) 4c, and we can write the approximation 
for the total mass on (ø): 

m~ fN) do, 

(2) 

where the summation extends over all the do that make up (ø). 
This approximation increases in accuracy as the do diminish. We have 
in the limit, on indefinite overall contraction of every element Jo, 
and with their number indefinitely increasing: 


m = lim Z A(N) 40 = pee 


In the same way, consideration of the mass of a spatial distribution 
of matter leads to the concept of a triple integral. We take a volume 
(v) bounded by a closed surface (S) and let matter of total mass m 
be distributed throughout the volume. We divide (v) into a large 
number n of small elements Av, and let the mass of Av be Am; 
assuming that the ratio 

Am 
Av 
tends to a limit on indefinite contraction of a Av to the point M 
contained in it, the limit is called the density of the (spatial ) distribution 
at the point M. 
We denote the limit by }( M): 


Am 
Aa M). 


lim 
We can write approximately, as above: 


m~ > f(D) Av, 
M) 
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where the summation extends over all the elements Av that make 


up (v). 
We have in the limit, on indefinite contraction of every element dv: 


m = lim © f(M) Av. 
(v) 
The above physical example leads us to a general definition of triple 
integrals, analogous to that for double integrals. Let (v) be a bounded 
domain of three dimensional space and let f{(Jf) be a function of 
points of the domain, i.e. a function that takes a definite value at 
every point M of the domain. We divide (v) into n parts, of volumes 
Av, Mv, ..., Av, and containing respectively the points M,, 
My, ..., Mn. 
We form the sum: 


SIM) Ar. (15) 
k=l 


The limit of this sum, on indefinite decrease of every sub-domain and 
indefinite increase in their number, is called the triple integral of the 
function f(M) over the volume (v): 


ff S IAN do =lim Sf, Avy 
v) k=1 


Remark [cf. 55]. Let dp be the maximum distance between two 
points of the sub-domain Av, (the diameter of the sub-domain) and 
let d be the greatest of the numbers d,, d,, ..., dn. Indefinite decrease 
of every sub-domain means that d— 0. If I denotes the magnitude 
of the integral, the definition given above is equivalent to the follow- 
ing: for any given positive number e there exists a positive ņ such 
that 


n 
I— Sf(M,) Av, <e, provided d<ny. 
k=l 


The rigorous theory of triple integrals will be found with that of 
double integrals at the end of the chapter. 


If f(M) = 1 throughout the domain (v), we get the volume v of 
the domain as: 
v=fff de. 
w) 


The evaluation of triple integrals requires their reduction to single 
or double integrals, which can then be evaluated by known methods. 
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We take a system of rectangular coordinates in space and suppose, 
for simplicity, that any line parallel to a coordinate axis cuts the 
surface (S) bounding the volume (v) in not more than two points. 
We draw a cylinder which projects the surface (S) on to the XOY 
plane to form the domain (o,,) (Fig. 44). The tangent curve of the 
cylinder and surface cuts (8) into two parts: 


(I) 2 = p (x,y); 
(II) Ža = Pa (z, y). 


Fie. 44 


A line parallel to axis OZ and passing through any point of (oxy) 
enters the volume (v) through section (I) and leaves it via section (II); 
the ordinates of the points of entry and exit, z, and z, will be known 
functions of (x, y). 

We now agree to divide volume (v) into elements Av as follows: 
we divide the area (oyy) into a large number of small elements 4o; 
taking an element as a base, we draw a cylinder for every element; 
the intersection of the cylinder with volume (v) is then divided into 
elementary cylinders of height 4z by means of planes parallel to 
XOY and at a distance 4z apart. The elements of volume obtained 
will be given by 

Av = Ao Az. 
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We take the element do containing the point N(x, y) and produce 
a line through it parallel to axis OZ to cut (S) in points with ordinates 
zı and 2,; then we take points M(x, y, 2) on this line, lying inside the 
corresponding elements Av. 

The sum of (15) can be written as: 


grena y,2) 4v = S Ao S f(x,y, 2 
v) 


(2) (2) 
Having fixed do for the present, we let the dz diminish, and get 
by the concept of definite integral: 


lim Sfle,y,2)4z= f (f(æ, y, 2) dz, 
(z) EA 
where the z and y must be looked on as constant parameters. Hence 
we have approximately: 
23 
> {x,y,z} de~ È f(x,y,z) dz = O(a, y). 
(2) 2 
But now it is clear, by the definition of double integral, that 


fx, y,2) 4o~ S Ao G(x, y) > f f D (x,y) do, 
(v) (Oxy) (Czy) 
i.e. 


VLG y, z) dv == Fal do fte, y, 2) dz. (16) 
(0 Oxy) EA 


On setting aside the geometrical interpretation, the above arguments 
lead us to the following rule for evaluating triple integrals. 
To reduce the triple integral 


Sis f (x, y, 2) dv 


to a single and double integral: (1) project the surface (8) pene! the 
volume (v) on the XY plane to form the domain (oyy); (2) find the 
coordinates z, and z, of the points of entry and exit of a io Bey to 
OZ through the point (x, y) of (oxy); (3) evaluate the integral 


ff (x, y, 2) dz, 
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taking x, y constant, then evaluate the double integral 


Zz 
Sf do { f (x, yY, 2) dz. 
(Czy) Zi 
The double integral may be in turn reduced to an interated integral, 
on using rectangular coordinates (x, y), and we finally get: 


b s z 
REIG y, z)dv = f de f” dy (f(x,y, z) dz, (17) 
& a Yı zı 


where the limits (y,, Y2) and (a, b) are found as in [54]. 

We suggest that the reader work out the alternative methods of 
reducing a triple to an iterated integral, by projecting (8) on to the 
YZ plane to form an area (oy,), or on to the XZ plane to form the 
area (0xz). The reader might also consider the more difficult cases, 
when lines parallel to the coordinate axes cut the surface in more 
than two points. 

Equation (17) may be written as: 


b t Zs 
Sis f(x,y, 2) dx dy dz = f dx i dy § f(x,y, 2) dz. 
v a Yı zı 


The product dæ dy dz is called an elementary volume in rectangular 
coordinates, and is obtained by dividing the volume (v) into infinitely small 
rectangular parallelepipeds by means of planes parallel to the coordinate 
axes. 

Formula (17) is derived rigorously at the end of the chapter. 
We remark that if lines parallel to the axes cut (8) in more than 
two points, (v) has to be sub-divided so that each part of it is not 
intersected at more than two points. Integration is carried out as 
above for each sub-division and the values added to give the integral 
over the whole volume (v). 

If (v) is a rectangular parallelepiped, bounded by the planes 


=a; «= bd; y=; y=b; 2=—a3 2= dg, 


parallel to the axes, the limits for all three integrals become con- 
stants: 


b b h 
PG y, 2) da dy dz = {dx f dy (F(z, y, 2) dz. (18) 


59. Cylindrical and spherical coordinates. A system of rectangular 
coordinates in space is often not the most convenient; of the alter- 
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native systems, the most important are those of cylindrical and 
spherical coordinates. The position of a point in the usual rectangular 
system is determined by three coordinates (a, b, c), the point being 
situated at the intersection of the three planes x = a, y = b, z= 0, 
parallel to the coordinate planes. We can think of the space here as 
filled by three families of mutually perpendicular 
planes 
e=C, y=Cy z= C3, 
where C,, C,, C,are constants, every point of space 
being the intersection of three planes of the 
families. Let the coordinates 2 and y now be 
replaced by r and ¢, whilst the coordinate z is 
retained, where 
æ =rcosg; y=rsng; z=zZ. 

The coordinate 7 is the distance of a point M 
from axis OZ and g is the angle between the 
plane passing through OZ and M and the plane 
XOZ (Fig. 45); p can vary from 0 to 22, and r from 
0 to (+æ). The coordinates (r, p, z) are called the cylindrical coor- 
dinates of the point M. Points on axis OZ correspond to r = 0, 
their coordinate p being indeterminate. 

We have the following three coordinates surfaces in this case 


r=C,; p=C,; z= 0}. 


The family r = C, is a family of circular cylinders with OZ as 
their common axis of revolution. The second family pọ = O, is a 
family of half-planes passing through axis OZ, and lastly, z = C, is 
a family of planes parallel to the XOY plane. 

On giving the variables r, p, z increments Ar, Ap, Az, we get two 
neighbouring surfaces for each family, which together form an 
elementary volume in cylindrical coordinates. Only one coordinate 
varies along each edge of the element, whilst every pair of edges is 
orthogonal (Fig. 46). To an accuracy of higher order infinitesimals, 
the element can be taken as a rectangular parallelepiped with edges 


Ar, rAg, Az, 


which leads us to the following expression for an elementary volume 
in cylindrical coordinates: 


dv = r dr dọ dz 
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as well as to the expression for a triple integral in these coordinates: 
Sf fF (AL) do = f ff f(r, 9,2) rdrdpdz, (19) 
() (>) 


the limits of integration being found by similar methods to those 
used for rectangular coordinates. 


Fie. 46 


Example. To find the mass of the segment of a sphere, filled with non- 
homogenous material whose density is proportional to the distance from the 
base of the segment (Fig. 47). 

We locate the origin at the centre of the sphere, and take for the XOY plane 
the diametral plane parallel to the base of the segment; we direct axis OZ 
from the origin towards the segment. Let the radius of the sphere be a, the 
height of the segment h, and the radius of base of the segment ry. 

The equation of the sphere becomes in cylindrical coordinates: 


r2+22ea? or 2%=a?— r. 
From what was given, we can write the density as 
f (r, p, z) =b + ez, 


where b and c are constants. 
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Application of (19) gives: 


Qn To Varr! 
m= |] f o+ erarapaz= f dp f rar f (b + cz) dz = 
i) 6 ò a—h 
To 
Z E arer 
=27 f [è + y? hae rdr. 


0 


We leave it to the reader to carry out the substitutions for z and the integra- 
tion; the result is 


rt 
m= bw + on —?., 


where v is the volume of the segment. 


Fic. 47 Fic. 48 


We now discuss spherical coordinates, or polar coordinates in space, 
as they are sometimes called. Let M be a given point in space, and 
OM the vector drawn from the origin O to M. The position of M 
can be defined by the following three quantities: the length ọ of OM; 
the angle p that the half-plane passing through axis OZ and M 
makes with the XZ plane; the angle 6 that OM makes with the positive 
direction of OZ (Fig. 48). With this, @ can vary from 0 to (+2); 
p is reckoned counter-clockwise from axis OX and can vary from 0 
to 2x; finally, 0 is reckoned from the positive direction of OZ and 
can vary from 0 to x. Definite coordinates 0, y, 4 correspond to 
every point M, and conversely. We drop a perpendicular from M 
to the XY plane to meet it in N, then drop another perpendicular 
NK from N to axis OX. The rectangular coordinates 2, y,z of M 
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are clearly given by OK, KN, NM. We have from the right-angled 
triangle ONM: 
ON = osin@ 


and on also using the right-angled triangle ONK, we finally get 
the formulae for passing from rectangular to spherical coordinates: 


x = ọsinĝ cosy, y=eosinOsing, z= ọcosð. 


Fic. 49 


We consider the families of coordinate surfaces: 
C=C, P= B= Cg, 


The first is evidently a family of spheres with centres at the origin; 
the second is a family of half-planes passing through axis OZ, whilst 
the third is a family of circular cones with axis of revolution on OZ. 
We remark that the origin O corresponds to ọ = 0, the values of p 
and 0 being indeterminate. The coordinate ¢ is indeterminate for all 
points lying on OZ, where 0 = 0 or x. 

We get an elementary volume in spherical coordinates by giving 
variables 9, p and @ infinitesimal increments 4ọ, Ap and 490. 
Only one coordinate varies along each edge of the element, and 
pairs of adjacent edges are orthogonal (Fig. 49). The element can be 
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looked on, to an accuracy of higher order infinitesimals, as a rectangular 


parallelepiped of sides 
doe, o dé, esinddg, 


so that the elementary volume is given by 
dv = o? sin 0 de d0 dọ , 
whence we can write a triple integral in spherical coordinates as: 


ARGI f (M) dv = Sfs f (e, 9, p) e? sin 6 dg dé de. (20) 
C) @) 


Reduction of the triple integral to an iterated integral can be carried 
out as follows, for instance: we take'the central projection (ø) of the 
volume (v) from the origin on to the sphere of unit radius (Fig. 50) 
[if the origin lies inside (v), (ø) is the entire surface of the unit sphere]. 
We draw radius vectors passing through points of (o), which in the 
simplest case enter (v) at a point of radius vector pọ) and leave at a 
second point of radius vector e, [we put o, = 0 when the origin 
lies in (v)]. Now we have: 


SSS fe 0, phe? sin Odedddp— ff sin 0 dedp T Fle 0, p) e? do, 
&) a) Gi 


where p, and p, are known functions of 6 and ¢. The limits of integra- 
tion for @ and @ are determined by the shape of (0). 


Example. To find the mass of a sphere made up of concentric layers of dif- 
ferent densities. We can take the density here as depending on ọ only and equal 
to f(ọ), so that the mass is 


2n n a a 
m= f f f /(e)esin0dedodp= f dp f sind do f(e) eè de 4x È f(e) e? do. 
(2) 0 ò 0 ò 
If the density is constant and equal to unity, we get the expression for the volume 


of a sphere: 


4ra? 


Remark. An important geometric significance attaches to sin@ dé do, 
this being the elementary area into which the surface of the unit 
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sphere is divided by meridians and parallel circles (Fig. 51). If we 
divide the surface of the unit sphere into elements do of arbitrary 
shape, we have:t 


SS fab) dv = f do | f (ML) o? do, 


where (a) is the domain obtained on projecting centrally from the 
origin the volume concerned on to the surface of the sphere. 


Fra. 50 Fie. 51 


Let an elementary cone be drawn, with its vertex at the centre of 
the unit sphere and having the contour of the element do as directrix. 
The area do measures the solid angle subtended at the centre by the 
element of any surface (S) cut out from (S) by this elementary cone. 


60. Curvilinear coordinates in space. The position of a point in 
space is determined in the general case of curvilinear coordinates by 
three quantities g3, q} qz, related to the rectangular coordinates 
x, y, z by the formulae: 


P (2, Y, 2) = 913 Y(T, Y, 2) = Ga; w (2, Y, 2) = q3. (21) 


The three families of coordinate surfaces are obtained on putting q,, 
qə qz equal to constants. Three pairs of indefinitely adjacent coordinate 
surfaces form an elementary volume dv. We omit the proof and 


tdo denotes either the element itself or its area. 
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simply give the result, analogous to that obtained in [57] for two 
dimensions. The elementary volume dv can be considered as a 
parallelepiped, to an accuracy of higher order infinitesimals; on 
solving (21) for x, y, 2, we get: 
© = Pı (91 9a 9s) 5 Y= Pr (Irgo); 2 = 1 (M1 Gar Gs), (214) 
and dv will be given by: 
dv = | D | dq, dg2 dg,, 


so that the formula for change of variables in the triple integral is: 
SES He y, 2) de dy dz = JSS F (au du qo) |D | dg, dga dgs, 

where F(q,, da; q3) is derived from f(x, y, z) by transformations (21,) and 

D is the functional determinant of z, y,z with respect to qi, qz q3: 


D= Bes (SP, ; dw, W, ` sar) OP, (= Go, Oy 3) 


Og, \@q, Oge 99, dq) ' Og, (Oga Og, 99, dds 


+ oe (Qe. pes Feu 


Ogg (0g, Oge ge 


ðq, 9g, Og. qı 


As in [57], the transformations (21) can be looked on alternatively 
as a deformation of space, so that a point with rectangular coordi- 
nates (x, y, z) becomes a point with rectangular coordinates (gq), qz 
qa), D being interpreted now as the coefficient of the change of 
volume at a given point on passing from (q,, q, qs) to (x, y, 2). 

The reduction of a triple integral in coordinates q,, qa qa to three 
single integrals, and the determination of the corresponding limits, 
follow the same lines as for double integrals [57]. 


The reader familiar with determinants may notice that D can be written es 
the following third order determinant: 


09, OY, ðw, 


ðq? gp’? ðq 

D=| P Oy, du, 
dqa ’ aq,” 0g, 

i Oy, Oy 00, 

| 99,’ Ogs’ 245 


A detailed treatment of these determinants appears in Volume III. 
Example. Let (v) be a tetrahedron, bounded by x + y + z = a and by the 
coordinate planes, and defined by the inequalities: 


a>0; yoo; 2>0; rtyte<a. 
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We introduce new variables: 
s +y+Hz=g; aly +2)=9g,4; az= 49,9295 


and interpret (q1, dz, 93) as rectilinear rectangular coordinates. It follows from 
the above expressions that: 


= . bs aly +2) . _ œ 
q=r+y +z; 2 ety+tz2z’ 3 te 
or 
ea UOI). p UR), y NEG 
a a PMs a? ne 


It is easily seen, after the manner of [57], that the tetrahedron (v) becomes 
the cube (v,) 0 <q, <a; 0 <q: <a; 0 <q, <a. We find that here, D = 
= g3q,/a°, so that the transformation is expressed by 


Y 1 
fi fre y, 2) dx dy dz = [{ [ra Ye» Ys) a GH 92 Iq, dg dgy; 
(v) (v) 


or, on finding the limits of integration: 


a a-x a—x-y 


a a a 
1 
fae [ay f ieue de= <> f atda [asda | Fan 9295) das: 
ò o0 0 0 o ô 


61. Basic properties of multiple integrals. We demonstrated earlier 
the basic properties of definite integrals by making direct use of 
their definition as the limit of a sum [I, 94]. The same approach can 
be used for the properties of multiple integrals. We shall assume for 
simplicity that all the functions are continuous, so that the integrals 
have a meaning unconditionally. 

I. A constant factor can be taken outside the integral sign: 


f (af) do = a | J f(N) do. 
(0) (0) 


II. The integral of an algebraic sum is equal to the sum of the 
integrals of the terms: 


ff Ua) — guide = f FAN) da — f f p) do 
o) (2) (2 


III. If the domain (ø) is divided into a finite number of parts, say 
into two parts (o,) and (o,), the integral over the whole domain is 
equal to the sum of the integrals over the parts: 


{ SAM) do = f AN) do + f f AN) do. 
(9) 


(a) (1) 
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IV. If AN) < ọ(N) in (0), 
§ Sf) do < f f o() do. 
(0) (0) 


In particular: 


pinag fa) do. 
o. a) 


V. If (NV) preserves the same sign in (c), the mean value theorem 
is valid, i.e. 
f SAN) GW) do = fp) f f pW) do, 
(o) (0) 
where J, is a point lying inside (ø). 
In particular, we have with ọ(N) = 1: 


f SAN) do = fN) 6, 
(0) 


where o is the area of the domain (ø). 

Triple integrals have analogous properties. 

We remark that it has always been assumed, in the definition of 
double and triple integrals as limits of a sum, that the domain of in- 
tegration is finite, and that the integrand f(W) is bounded through- 
out the domain, i.e. there exists a number A, such that|/(N)|< A 
at every point N of the domain of integration. If these conditions are 
not fulfilled, the integral may exist as an improper integral, in a manner 
similar to that described for single definite integrals in [I, 97, 98]. We 
deal with improper multiple integrals in § 8 [86]. 


62. Surface areas. We take 


z= f(x,y) (22) 
as the equation of a given surface (S) and use the notation: 
3, 0 
LA = P; 3, =q. (23) 


We have seen [I, 160] that the direction-cosines of the normal (n) 
to (S) at the point (x, y, z) are proportional to p, q and (—1), i.e. 
using analytic geometry, they are given by: 


cos (n, X) = ; cos (n, Y) — 


ae ee g ; 
+V1 FP tg tVl+ P+ e ’ 

24) 
1 


METETA 


cos (n, Z) = 
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We find the area of the portion of (3) whichis cut out by the cylin- 
der (C), which projects the portion on to the XY plane as the domain 
(o) (Fig. 52). We divide (c) into small elements Ao; the cylinders with 
bases Ao divide (8) into ele- 
ments AS. 

(Emt) Let N(é, 4) be a point of Ac, 
the corresponding point of (8) 
being M(E, n, ¢), where €=f(é, 7). 
We draw the tangent plane and 
normal (n) to the surface at M, 
and let AS’ be the elementary 
area cut from the tangent plane 
by the cylinder of base Jo. 

We define the area of our por- 
tion of surface (S) as the limit 
of the sum of the elementary plane 
areas AS’, when the number of 

0 elements Ao increases indefinitely, 

Fra. 52 whilst each contracts indefinitely. 

We show that this limit is given 

by a double integral over (ø). The 

element Ao is the projection of the plane element AS’ on the XY 

plane, whilst the normals to these elements form the angle (n, Z), the 
cosine of which is given by the third of expressions (24); hencef 


1 


Ao = AS’ ————~ 
Vlit+p?+¢ 


AS’ =J1+p?+¢@Ao, 


} Let (S,) be the projection of the plane domain (S,) and let pọ be the acute 
angle between these two planes, i.e. the angle between their normals. It is easily 
seen that the relationship holds: S, = S, cos p, where S, and S, are the areas 
of (Sı) and (S,). 

We show this by a rectangular dissection of the areas by two families of 
straight lines, one family being parallel to the line of intersection of planes 
(S,) and (S,). Let the rectangles of (S,) be the projections of those of (S,). 
A length parallel to the intersection of (S,) and (S,) is unchanged on projection, 
whilst a length in the perpendicular direction is multiplied by cos ø; thus if 
dady is an elementary rectangle of (S,) the corresponding elementary rectangle 
of (S,) will be cos ¢ dady, and hence: 


8, = cos p dz dy = cos pff dg dy == 8, cos ¢. 
Si) (Sx) 
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so that the area S of our surface becomes by definition: 
S = lim > AS’ = lim >) V1 + p+ £ Ao. 
(0) 


The limit on the right-hand side is given by the double integral over 
(ø), so that 


S= [S VT+ do = f f VT+ p+ g dedy (25) 


which is the required formula for the area of the portion of a curved 
surface cut out by the cylinder with generators parallel to axis OZ. 

The integrand in (25) represents an elementary area dS of the sur- 
face. We can write, on using the expression for cos (n, Z): 


do,y 


dS = V1 + p? + q? doy = Teosa By oF dø, = |cos (n, Z)| dS. (26) 


Here, do, is the projection of dS on the XY plane. We have 
to take the absolute value of cos (n, Z), since the elementary areas 
doy and dS are reckoned positive. 

We assume that the p and q defined by (23) are continuous functions 
of (x, y). The above argument leads to the expression of the limit 
of the sum of areas 4S’ as an integral (25) of a continuous function, 
and proves, moreover, that this limit exists. A defect of the above 
definition of surface area is that it contains the operation of pro- 
jection, which is bound up with the choice of XY plane. It can be 
shown that the magnitude of the surface area is in fact independent 
of the choice of XY plane. We also remark that, if the lines parallel 
to OZ meet (S) in more than one point, evaluation of the surface 
area by (25) requires division of the surface into separate sections 
and evaluation individually of the area of each. 

A new definition can be given that is independent of the choice 
of axes. Let (S) be a piece of a smooth surface, bounded by a smooth 
contour. Let (S) be divided into portions (Sj), (83), ..., (Sn); we take 
an arbitrary point M, of each (Sx) and let p, be the area of the pro- 
jection of (S,) on to the tangent plane to (S) at Mp. It can be shown 
that, given the assumptions regarding the smoothness of (S) and 
its contour, the sum p, + Pa + .-. + Pn tends to a definite limit S 
if the greatest of the diameters ô of the portions tends to zero [cf. 55]. 
With the explicit equation (22) of the surface and with continuous 
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derivatives (23), this definition of surface area also leads to formula 
(25) for the area S. (G. M. Fikhtengol’ts, Kurs differentsialnogo i 
integralnogo ischisleniya, Vol. III, §§ 599—601). 


Examples. 1. To find the area of the portion of spherical surface considered 
in the example of (56). 


We have 
=Va? r? uè: St E a TR 
z Va T ys p yay z>? 
—y CoPI 
q Va? — a? — y? z 


EEA 2 2 2 2 2 a 
VIF p+ g= TENTA IE L 


nm 
2 a coso 
rdr 


S= f f->rdrap=2a | ap J 75 * 
(8) ry 0 Ya? —r 


n T=aco8 p n 
T Ey 
=2a (— Var) dp = tar | (1 — sing) dp == 2at (Z _ 1). 
0 r=0 
2. To find the area of the intersection of the cylinders (Fig. 53) 
a? -+ y = a?, (27) 
and 

yY tz =g. (28) 


It is more convenient to take y and z as the independent variables in this 
problem, and x as a function of these, given by (27). Integration is over the 
circular area in the YZ plane, with circumference given by (28). The shaded 
area in the figure is evidently 1/8 of the total 
required area, so that we have: 


S= in + p+ g?dy dz, 


where 
dx Yy dg 
Peay ā eT 
Va? + y? a a 
Vite te Si ae 
Fre. 53 © z Yat — y? 
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and hence: 
a Yat—z a a a . 
Y ; a — z 
s=8a [dz f ~H ga | arsin “= ae = 
6 ò pa — y? 0 < 
2=a a 
2 z2 z 
= 8a [zaro sin T= +f aol- 
z=0 @ 
z=a 
= 8y A| = 8a. 
z=0 


63. Integrals over a surface and Ostrogradskii’s formula. The concept 
of a double integral over a plane domain may easily be generalized 
to the case of integration over a surface (curved). Let (8) be a surface 
(closed or open) and F(M) be a continuous function of points of the 
surface. We divide (S) into n parts and let AS,, AS,, ..., AS, be the 
areas of the parts and M,, M,, ..., Mn arbitrary points lying on them. 
We form the sum 


> F (M,)-AS,. 


The limit of this sum on indefinite increase in the number of parts n 
and indefinite contraction of each part AS, is called the integral of 
the function F(M) over the surface (8): 


f {FM jds = lim S F( M,) AS,. 


(S) n>o 


Let a line parallel to the z axis have only one intersection with 
the surface (Fig. 52) and let (ø) be the projection of (S) on theplane XY. 
We can reduce the integral over (S) to an integral over the plane 
domain (oxy) by using (26) for the relationship between an elementary 
area of (S) and of the corresponding — (axy): 


[ra M) ds = I TaT deny, (29) 


We assume here that cos (n, Z) differs from zero and that the function 
F(N) has the same value at the point N of (ø) as the given function 
F(M) has at the point M of (S) whose projection is N. If the equation 
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of (S) is given explicitly by (22), whilst F(M) is expressed as a function 
F(a, y, z) of the coordinates, it is sufficient to substitute z = f(x, y) 
in the expression of F(z, y, z) when integrating over (oxy), ie. F(N) = 

= Fla, y, f(x, y)}. The denominator on the right-hand side of (29) 
is given by de third of equations (24). 

We note that the properties of double integrals listed in [61], 
including, in particular, the mean value theorem, apply to an integral 
over a surface. 

We now prove Ostrogradskii’s 
formula, which is basic in the the- 
ory of multiple integrals, and 
which establishes the relationship 
between a triple integral over a 
volume (V) and the integral over 
its boundary surface (8). We shall 
make the same assumption as in 
[58], that lines parallel to the z 
axis intersect (S) in not more than 
two points. We keep the notation 

Fie. 54 of Fig. 44 [58], whilst intro- 

ducing in addition the direction 

(n) of the normal to (8), (n) being 

assumed taken outwards from (V) (the outward normal) (Fig. 54); (n) 
forms an acute angle with the z axis on the upper part of the surface 
(II), and an obtuse angle on thelower part (I). Hence, cos (n, Z) is ne- 
gative over part (I) and |cos (n, Z)| =— cos (n, Z). Formula (26) gives: 


doxy = cos (n, Z) dS on (II); dozy = —cos (n, Z) dS on (I) (30) 


Let R(x, y, 2) be continuous in the domain (v) as far as the boundary 
(S) and let its derivative dR(z, y, z2)/dz be continuous. 
We consider the triple integral over (v) of the function dR(a, y, 2)/dz. 
We have, on using (16): 


i] SE ie dea (ae f 2B lows) ae 


But the integral of a derivative is equal to the difference in the 
values of the original function at the upper and lower limits: 


ME AREL dy = | | [R (æy, 2) — R (2, y, 2)] doy 


(ozy) 


63] INTEGRALS OVER A SURFACE AND OSTROGRADSKII’S FORMULA 197 


or 


i a ee Y, z) d oy — | [R(zy,%) doy, 


(azy) 

We reduce integration over (oxy) to integration over (8) on replac- 
ing do,, by dS in accordance with (30); the first of expressions (30) 
must be used for the first integral, containing the ordinate z, of part 
(II) of (S) as variable, the result being an integral over (II), whilst 
similarly, the second of expressions (30) is used for the second inte- 
gral and an integral over (I) obtained: 


mS BH (epiti dv = ff Ro bey Shope: Dyas 
(iy) 


+ iv R (zx, y, z) cos (n, Z) dS. 
(1) 
We can drop the subscript for z since it is implied by the part of the 
surface over which integration is carried out. The right-hand side 
contains the sum of integrals over (II) and (I), i.e. amounts to the 
integral over the whole of (8): 


[S [BEL w= f| Rie y,z)cos(n, Zas. (81) 
() (S) 


We might have taken another two functions P(x, y, z) and Q(z, y, 2) 
and shown in a similar manner that 


|f| | [P (x.y, 2) cos (n, X) as 
(v) (Ss) 


{J > a 2 dv= [foa y, z) cos (n, Y) d8. 
@) is) 


We arrive at Ostrogradskii’s formula on adding the left- and right- 
hand sides of the three expressions obtained: 


SY) Cae + ay + ae) 20 = 


ca cos (n, X) + Q cos (n, Y) + R cos (n, Z)] dS. (32) 
9 


The arguments 2, y, z of P,Q and R have been omitted for brevity, 
but it should be borne in mind that these given functions are con- 
tinuous in the volume (v), together with their derivatives. 
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A variety of applications of Ostrogradskii’s formula will be found 
in later chapters. 

The quantities cos (n, X), cos (n, Y), cos (n, Z) are functions 
defined on the surface (9) and have been assumed continuous. A more 
general assumption can be made, that (S) is divisible into a finite 
number of pieces, on each of which the functions are continuous. 

This would apply if (S) were, say, a poly- 
(n) hedron. 

We supposed when deriving (31) that lines 
parallel to the z axis cut the surface (8) of (v) 
in not morethan two points. A generalization 
is easily made for other types of domain. We 
notice to start with that if (S) includes a 
lateral cylindrical part with generators par- 
allel to the z axis in addition to the upper 
Firo. 55 part (II) and lower part (I), we have 
. cos(n, Z) =0 on the lateral part, so that addi- 
tion of this part tothe right-hand side of (31) 
does not change the value of the integral over the surface; the proof 
of the formula therefore remains valid throughout. It is sufficient in 
more general cases to divide (v) into a finite number of sections 
with the aid of cylindrical surfaces whose generators are parallel 
to the z axis, such that the previous conditions are satisfied for each 
section, to which (31) may then be applied. Addition of the results 
obtained will give us the triple integral over the total volume (v) 
on the left-hand side; we shall have on the right-hand side the sum 
of the integrals over the total surface of the sections, whilst the 
integrals over the cylindrical surfaces that have been added will 
be zero, as shown above. The result of adding the right-hand sides 
will, therefore, be the integral over the surface (8) of the original 

volume (v), and (31) is shown to be valid in general. 

We remark that the arguments remain valid when (v) is bounded 
by several surfaces: by one from the outside and by the remainder 
from inside. Figure 55 illustrates the case of (v) bounded by two sur- 
faces. The integration on the right-hand side of (31) is now over all 
the surfaces bounding (v), with the normal (n) directed inwards 
from the inner surfaces [i.e. outwards from (v)]. 

64. Integrals over a given side of a surface. Use is occasionally made of a 


different definition and a different notation for surface integrals. We start 
with the case when (S), illustrated in Fig. 54, satisfies the conditions laid 
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down at the beginning of the previous article. Normals can be drawn in two 
opposite directions at every point of the surface. One direction forms an acute 
angle, and the other an obtuse angle with the z axis. Correspondingly, we can 
distinguish between an upper and a lower side of the surface. Let R(x, y, z) 
be given on (S), as above. We take the integral: 


{Í R cos (n, Z) ds. (33) 
(8) 


The value of the integral depends on the choice of the direction of the normal 
or what amounts to the same thing, on specifying the side of the surface over 
which the integration is carried out. On integrating over the upper side, 
cos (n, Z) > 0, and cos (n, Z) dS = do,,, whilst over the lower side, cos (n, Z) < 
<0, and cos (n, Z) dS = —do,,, where do, is the projection of an element 
of (S) on the XY plane, i.e. an element of the area (a) in (29). We can write 
do, = da dy in (x, y) coordinates, so that integral (33) reduces to an integral 
over (a) in the XY plane: 


{{Rleut@widedy or —STR[eyfi@ydedy, (4) 


depending on which side of the surface the integration is made. These are in 
fact often written in the same way, as 


Si R dz dy, (35) 


whilst indicating which side of the surface is concerned. For instance, if integra- 
tion is over the lower side of (S), (35) amounts to the second of integrals (34). 
We can define (35) directly as the limit of the sum 2R(M;,) 40k, where R(M,) 
is the value of R at the point M, of the element AS; of (S) whose corresponding 
projection on the XY plane is Ao,, the Ao; being reckoned positive if integration 
is over the upper side of the surface and negative if over the lower side. 

We now take the general case of (S), and let M, be a given point of the 
surface. We specify a definite direction for the normal (n) at this point; starting 
from M, and moving continuously over (S), we examine the continuous varia- 
tion in the direction of (n). If any continuous movement leadsto a definite 
direction of the normal at any point of the surface, it is said to be two-sided. 
If we specified differently the direction of (n) at the initial point M , continuous 
movement would lead us to an opposite direction of (n) at every other point 
of the surface. This allows us to speak of the two sides of (S), depending on the 
direction specified for (n) at M, and therefore specified at the remaining points. 
Having fixed the side of the surface, we get a definite value for integral (33), 
which may be written in the form (35) on indicating the side concerned. 

We can similarly define the integrals: 


ff Pdydz and ffQdrdz. 

(S) (S) 
where P(z,y,z) and Q(x,y,z) are functions given on (S). These integrals coincide 
respectively with: 


f f P cos (n,X)dS and f fQ cos (n, Y) dS. 
(S) (S) 
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With these definitions, (32) can now be written as 
oP , 8Q , OR 
JJ) (Geta ee) we ff (P dy dz + Q dz dz + R dæ dy) 
n (5) 


where the integration on the right-hand side is over the outer side of (S). 
We note the existence of one-sided 
surfaces, where continuous movement 


over the surface implies continuous 

\ variation in the direction of the normal 

A | such that, on returning to the initial 

point, the new direction of the normal 

can be the opposite of the original direc- 
tion. A simple example is the Möbius strip, 
&0 obtained by taking a rectangular sheet 
Fia. 56 of paper ABCD, twisting it once and 
fastening edge AB to edge CD in such 

a way that A coincides with C and B 


with D (Fig. 56). Both sides of the ring thus obtained could be painted, without 
removing the brush from the surface, i.e. without passing over its boundary. 


65. Moments. An application of multiple integrals is to the theory of the 
moments of different orders of mechanical systems. Let a system of n material 
particles of masses m,, m,, ..., Mp be given at points: 


My, M s.. Mp 


respectively. 

The k-th order moment of the system with respect to a plane (A), a straight line 
(d) or a point (D) is defined as the sum of the products of the mass of each particle 
of the system with the k-th power of the distance from (A), (d) or (D): 


The zero order moment becomes from this view-point simply the total mass 
of the system: 


n 
m= > Mi. 
i=1 


The first order moment with respect to the plane (4) is called the statical 
moment of the system with respect to the plane. We encounter the statical moments 
relative to the coordinate planes in the expressions for the coordinates of the 
centre of gravity of the system: 


n n n 
> MX D myi D mz 
=l i=l 


i 
naea ty E (36) 
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The distances x;, Yy; Z; from the coordinate planes are reckoned algebraically 
here, i.e. as positive or negative. 

The second order momenis are usually called moments of inertia of the system. 
The expressions 


n n n 
2 
rim; Z yim; XY zim 
i=1 i=1 {=1 
represent the moments of inertia of the system with respect to the coordinate planes ; 
the expressions 


n n 


n 
È itm Melee X elt vim 
= t= t= 


represent the moments of inertia relative to the x, y, z axes; and finally, 
n 
B A++ Dm 


48 the moment of inertia relative to the origin O. 
The expressions 


n n 


n 
Š uiam; D arm; X tiy mp 


i=l i=l i=1 


are occasionally encountered in addition to the above, and are referred to as 
the products of inertia of the system relative to the x, y, z axes. 

If we are dealing with continuous distributions of matter instead of systems 
of a finite number of particles, the above sums have to be replaced by definite 
integrals; these are single, double or triple, according as the distribution of mass 
is linear, over a surface or over a volume; instead of the m,, we now have the 
density f(M) at a given point M multiplied by the 
element of line, surface or volume. 

For example, the moment of inertia of a three-dimen- 
sional domain (v) relative to axis OX is given by the 
triple integral 


s i S (yt +2) M) do. 


If f(M) is the constant density fẹ this can be taken 
outside the integration sign; an f, could be cancelled out 
in (36), leaving functions of x, y and z under the integrals Fra. 57 
in the numerators, and the volume or area of the total 
domain in the denominator. 

Examples. 1. The centre of gravity of a sector of a homogeneous sphere 
(Fig. 57). On using the coordinate system shown in the figure, we need only find 


the ordinate 
§ {Sea 
— 2) 


£ v 


z 
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We have here: 


ax a 


s= fofa pa fede = ana — cos a) = nah 
J § S z do = fap f'sin 0 d0 f'e oos og de = 2a f'sin 0 cos 6 do f'o? de = 
(0) 
=- a(l — cos 2a) 


_ 3 1 — cos 2a 3 3 
of ake | 2 i cosa ~ 8 a(1 + cos a) = -y (2a — h), 


where a is the radius of the sphere. 
2. If the mass is assumed to be distributed only on the spherical surface 
(S) of the sector, the ordinate of the centre of gravity becomes 


feds 
— ($) 
t=, 


8 


where s is the area of surface (S). The equation of the surface is here æ? + y? + 
+ 2 = a? or z = fa? — (x? + y?) and it is easily shown that 

1 z 
+P + 4 


__ dow = n aor 
Jette ff Soa (n, 2) =a | f dos = aa sin? a, 


(dzy) Fay 


cos(n, z) = > 


so that 


where (ayy) is clearly a circle with centre at the origin and radius a sin a. 
The area s will be: 


— i do. 
a= VIF PT èdo =a Í - xy = 
Í xy J Yat — (x? aim y) 


(Gay) 


= ——— = 2a? (1 — 
alae TET 2na? (1 — cos a), 


and finally: 
PEA na? sin? a acos: 2 
£ 27a? (1 — cos a) 2 


We had in the previous example the smaller value for z, : 


3 3 a 
Ea == ro 
3 a(l + cos a) g 2008? —> 
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3. If the centre of gravity is at the origin, all the statical moments are zero, 
as follows at once from the relationships: 


SSS af de = MXg3 
(») 


SSS uf dv = myg; 
w) 


Sf f 2f do = mzg. 
©) 


4. The moments of inertia of a homogeneous right circular cylinder (Fig. 
58) relative to the axis of the cylinder and to the diameter of its central 
section. Taking f, as the constant density, we have: 


2n a 


h 
Jz: = fo | [tr dr dọ dz = 2f, [ap frar | az = nat hf, = ms ; 
0) òo 6 0 


2x h a 
Jx=ffSS (2% + r? sin? g) r dr dp dz = 2, f de Í dz f (2? + r? sin? p) rdr = 
@) o o 0 


2n h a 2n h a 


= 2f, | do [ etaz frar z 2f, | sin? p dp faz fear = nh? a? f,+ 
ô 0 0 6 0 ò 


he a? 


+ Phat fom HE) 


where 2h is the height of the cylinder, a is the radius of its base and m its mass 
5. The moments of inertia of the homogeneous 
ellipsoid: 


x? y? z2 
a tg ta TL 


We let f, denote the density and have, on dividing 
into layers parallel to the XOY plane: 


+e 
Jay = fo | | [ ax dy az = 7, f 2? nab (-5) dz = 
v -e 
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We can change the symbols cyclically to find that 
1 


1 
Tyg = m+ -p J= m: -b 


1 
Jx =I xy + Jy, M+ z +e) 


1 
5 


=m. 


chat) J, =m = (at +b) 


1 
Jy Jy ty tx =m. -zg (a? + bP + e). 


6. The kinetic energy of rotation of a rigid body about an axis (6). 

We know that the velocity V of any point of the body is equal to 
the angular velocity w of the body multiplied by the distance of the point from 
the axis of rotation (6). We find the kinetic energy of the body by dividing it 
into elements of mass 4m and finding the kinetic energy AT of each; then 

T = X AT. 

Since the element Am is small, its mass can be considered to be concentrated 
at any point M of the element; the kinetic energy AT of Am is then 

4p = + V2 Am = at r$ (M) 4v, 


where f( M) is the density of the body at M and r} is the distance of M from the 
axis (ô). Hence we obtain, by definition of a triple integral: 


T= |f [F eiM -y Ja 
(v) 
Ja = Í S f r3 AM) do 
(») 


where 


is the moment of inertia of the body about the axis of rotation (ô). 

Remark. It is sometimes more convenient to evaluate the volume or the 
moments of a body in terms of double or even single instead of triple integrals. 
This comes from the fact that, when a triple integral is represented as the 
single integral of a double integral, or as the double integral of a single integral, 
the interior integral may occasionally be evaluated directly from elementary con- 
siderations without performing an actual integration. This creates the impression 
that a double or single, instead of a triple integral is used for the calculation. 

We can take as an example the moment of inertia Jy, of a body (v) relative 
to the XY plane, where (v) is bounded by the planes z = 0, z = h and by the 
surface formed by revolution of the curve x = f(z) about the z axis. The mo- 
ment can be obtained from a single integral if the body is imagined to consist 
of plane circular dises, parallel to the XY plane; the volume of such a disc is 
a[f(z)}? dz, and we can write: 


h 
Tyy = f 22 [ f(z) F dz. 
(i 
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The same moment of inertia is given by the triple integral: 
h 
J yy = f f f 27 dz dy dz == f 22 dz f f dz dy, 
(2) 0 (2) 


where (¢,) is the section cut from (v) by a plane parallel to the XY plane and 
distant z from it. The interior double integral gives the area (g,), i.e. is equal to 


n{f(z)¥. 


§ 7. Line integrals 


66. Definition of a line integral. Let (l) be a curve with a de- 
finite direction in space (Fig. 59), A being the initial point and B 


We 
eee 
xs 
ò 
w 
w wt 
(9 
H2 
Mı 
eyo ks 
AN 1 
Fic. 59 


the end-point of the curve. We shall measure the length of arc along 
(I) from the initial point A. Let a continuous function f(M) be given 
on (l), and let (1) be divided into n parts by the points Mo, M,,..., 
Mn- Mn, where M, coincides with A and M, with B. We take any 


point N, of each segment Mpy Mp4, (k = 0, 1,..., n — 1) and form 
n-l 

the sum © /(N;) As;,, where As, is the length of arc My M;,4, of (Ù). 
k=0 


The limit of this sum on indefinite increase in the number n of di- 
visions and indefinite decrease of each of the Mp Mpy is called the 
line integral of the function f(M) over (l) and is written: 


n=l 
ql een SAA (1) 
k=l 
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The position of the variable point M on (l) is fully defined by the 
length of arc s = AM, and f(M) can, therefore, be taken as a function 
of the independent variable s, i.e. {(M) = f(s); integral (1) is an ordi- 
nary definite integral: 


fads = f f(s) ds, 
i ò 


where / is the length of arc of (l). We note that (J) can be a closed 
curve, i.e. B can coincide with A. 

So far, we have not used the fact that (1) has a specified direction, 
which becomes important later. Let the coordinates of M be (x, y, 2), 
referred to a system of rectangular axes in space. Let P(z, y, 2) 
be a continuous function along (l), and let (k, nr Čr) denote the co- 
ordinates of Ny, and Ax, the projection of the directional arc M, M4, 
on the x axis; Ar, can of course be positive, negative, or zero. We now 
form the sum of the products of P(N;) = P(&, Ng, Sx) with Az, instead 
of Asy, i.e. the sum 


~ 


n— 


D P(E Me Er) AX: 


k=0 


The limit of this sum is called the line integral of P(x, y, z) over (l) 
and is written 


n=l 
f P(x, y, 2) da => lim > P(é,, Nhs E) Ax,. 
@ k=0 


The integrals 
{ Q(z,y,z)dy and f R(x, y, z) dz 
® © 


are similarly defined, where Q(z, y, z) and R(x, y,z) are continuous 
functions along (l). Addition of these three integrals gives us the gene- 
ral form of the line integral, which is written: 


{ P(x, y, 2) da + Q(x, y, 2) dy + R(x, y, z) dz. (2) 
@ 


Integral (2) is by definition the limit of a sum of the form: 


n-1 
> [P (Ek No r) AX, + Q (1 No Èr) MY + E (Er Nw fy) Az], (3) 


k=0 


where Ayk, Az, are the projections of Mp Mpy on the y and z axes 
respectively. The connection between integrals of type (2) and type (1) 


66] DEFINITION OF A LINE INTEGRAL 207 


is easily found. The coordinates (x, y, z) of the variable point M of 
curve (J) can be taken as functions of the length of ares =U AM. 
As we know from [I, 160], the derivatives of these functions are the 
direction-cosines of the tangent to (J), i.e. 

da 


de = 008 (t, X); W cos (t, Y); ge L o (t, Z), 


ds ds 

where ¢ is the tangent to (l) at M, directed in the same sense as (l). 
The symbol of the type (a, 8) denotes as usual the angle between the 
directions a and £; the cosine of the angle is independent of the sense 
in which it is measured, and the sense is not specified here. To an 
accuracy of higher order infinitesimals we can take: 


Ax, = cos (ty X) As, Ay, = cos (t,,¥)As,, Az, = cos (t,, Z) As,, 


where ¢, is the direction of the tangent at Nx, and integral (2) reduces 
as the limit of the sum (3) to form (1): 


| Pde + Qdy+ Rdz= 
È 


we 


=f [P cos (t, X) + Q cos (t, Y) + R cos (t, Z)} ds, (4 
(9) 


where P, Q, & can be considered as functions of s along (I). 

Let the equation of (l) be given in parametric form: 

r= p(t); y=y(t); z= a(t), (5) 

where the point (x, y, z) describes the curve (l) from A to B as t varies 
from ato b. We shall suppose functions (5) continuous, with continu- 
ous first order derivatives, in the closed interval (a, b), whilst taking 
a < b for clarity. 

Let t = Tp correspond to Mp, and let us consider the first of the 
summations (3). Let t = qth correspond to the point (k, Nr, ¢,) of the 
curve. We can write, using the formula of finite increments [I, 63]: 


Ax, = P (T41) — P (Tr) = P' (Tk) (Tra — Tr)» 
where qty is a value of t in the interval (Tp, t,4,). Our sum can now be 
written as: 


n-i 


D P (Eko No Cr) £y = 


n—1 
=> Pig (ti), Y (Tk) @ (ti) P (Th) (Tki — Th) (6) 
k=l 
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This closely resembles the sum: 
n=l 
o= X P [p (th), v (Ti), © (TO) P(T) (Thar — Ti), 
k20 


which tends in the limit, as the greatest of the (Tk+4ı — tx) tends to 
zero, to the definite integral: 


b 
$ Pip (1), v(x), w(t] gp (tdr. (7) 


We now show that the difference between o and (6) tends to zero. 
It will follow immediately from this that (6) has a limit, equal to 
integral (7). The difference is of the form: 


n-l 
n= > {Pl (ti), v(t), © (%)] — 
k=0 
— P [p (th), Y (TK), @ (xh) ]} QP’ (Th) (Tra — Ty): 
Since tý and tg belong to (Tk, Tk) and P[9(zt), p(t), œ(t)] is uni- 


formly continuous, for any small positive € there exists a ô such that 
[I, 32] 


|P [p (ti), (ti), © (t)] — P [p (th), v (th), oH] < e, 
provided (Tk} — Tk) < 6. The absolute value of 7 will then satisfy 


n-l 
|n| <e > |p (th) | (Tkt — Ta). 
k=0 


The function y’(t) is continuous and therefore bounded in (a, b), 
i.e. | y’(t) | < K, where K is a definite number [J, 35]. Hence we have: 


n=l 
|n| <EK © (Trt — Th) = eK (b — a).' 
k=0 


Since e— 0 if max (Tk4ı — Tk) —> 0, it can be seen that 7 in fact 
tends to zero, and (7) is the limit of summation (6). The remainder of 
summations (3) can be considered in the same way and hence we can 
show that, with the assumptions made, integral (2) can be written as 
an ordinary definite integral: 


[Pdr +Qdy+Rde= f[Po (2) + Qy 0) +Ro'(]dr, (8) 
0) a 


where P, Q and R must be expressed in terms of t in accordance 
with (5). 


67] WORK DONE BY A FIELD OF FORCE. EXAMPLES 209 


Several of the properties of simple integrals listed in [I, 94] may be 
generalized at once for line integrals. For instance: 
I. If the curve (l) consists of distinct portions (L), (l), ..., (lm), 


{Pde + Qdy+ Rde= JIP dr +Q dy + Ede] + 
(0) h 
+ i [P dz +- Q dy + Rdz] +... + J P de + Q dy + Bde}. 


IL. The value of a line integral depends on the direction of the 
curve (l), as well as on the form of (l) and the integrand; however, 
when the direction of the curve of integration is changed, ihe integral only 
changes sign. 

If (l) as a whole does not satisfy the conditions stated above, but 
can be divided into a finite number of sections, each of which has 
parametric equations (5), we can apply (7) to each section, and the 
integral over the whole curve can be written as the sum of the integrals 
over the separate sections, It may easily be shown that this is equi- 
valent to the limit of summations (3) for the whole curve. We shall 
only consider in future curves (l) which satisfy the conditions just 
mentioned. Finally, we notice that if 7 is the length of are s =_ AM, 
formula (8) reduces to (4). 

If (2) is a plane curve, say in the XY plane, (2) becomes: 


{ Pdz + Qdy, 
® 
where P and Q are functions of (x, y) given along (I). 


67. Work done by a field of force. Examples. Calculations of work 
done lead naturally to line integrals of type (2). Let a point M de- 
scribe a trajectory (J) under the action of a force F, which is a function 
of points along (l). The work done is found by dividing (l) into small 
segments and considering one of these, say My, Mk. In view of the 
smallness of the segment, F can be reckoned to maintain constant over 
the segment its value at Mp, whilst are UM, Mp4; can be replaced 
by chord Mp Mr}. The work done over the segment is thus given 
approximately by 


AE, ~ | F| | MM pya | cos (Fy, MyM p41), 


where | F, | denotes the length of vector F at My, | My Mp4 | is the 
length of M,M;,4,; and AE, is the work done over the segment 
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UM,My4,. On using the formula of analytical geometry for the 
angle between two directions, we can write: 


AE, ~N | F, | | M,M p41 | [cos (E, X) COS (M, M y41 X) + 
+ cos (Fp, Y) cos (M, Mpy Y)cos(F,, Z) cos(M,My,41, Z)], 


or, on removing the brackets and denoting the projections of F on 
the coordinate axes P, Q, R: 


AE, ~ Py Avy + QAY, + Brzy 


where the subscripts of P, Q, R indicate that their values are taken 
at Mp. We now sum over all the segments and pass to the limit, so 
that the work done is accurately: 


E = Í P dz + Qdy + Rdz. 
a) 


Examples. 1. The work done by the constant force of gravity when a point 
M of mass m is displaced via any curve (l) from the position M,(a,, bis c,) to 
M,(a;, bz, ¢2) is given by the integral 

Cy 
{ Pdx+Qdy + Rdz = f mgdz= mg (c,—¢,) 
0) Cy 
(the z axis is directed vertically downwards); hence it follows that the work 
depends only on the initial and final positions of the point, and not on the path 
traversed. Here we have an example of a line integral the value of which 
depends only on the initial and final points of integration, and not on the path. 

2. The work done by the gravitational force when a unit mass is displaced 
from the point M, to M, in its attraction to a fixed centre of mass m. If we 
take the centre at the origin and let r be the radius vector to the point, F can 
be seen to be in the opposite direction to OM and equal to fm/r?, where f is the 
gravitational constant. Hence we have here: 


fm r, __ m y ; _im_ z 
Sra.) Qema se fT ae Sect 
E=—f sår ty eu tee a fm [rr] a l 


9 (9) 


and if r,, r, denote the respective distances of M, and M, from the gravitational 
centre, 


and here the work, i.e. the corresponding line integral, depends only on the 
initial and final points, and not on the path. 
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If we introduce the potential of the point mass, 


_ fm 
mo 
so that 
__9U __9U | _9U 
T ey’ Q Oy’ Oz’ 


the work E is given by the difference in the values of U at M, and M,, ie. 
E=U(M,)—U(M)). 
We consider line integrals over plane curves in the next examples. 


3. We consider the established plane flow of an incompressible fluid of con- 
stant density which we take as unity. With this, the velocity of motion v 


y 


Fra. 60 


of a fluid particle situated at M(x, y) depends only on (x, y). We find the quantity 
of fluid g flowing in unit time past a given contour (l) (Fig. 60). Let u and v 
denote the projections of v on the coordinate axes. We take an element VMM’= 
= de of (l), and assume that the velocities of all the particles of the element 
are approximately equal; it can then be seen that, during a small interval of 
time dt, all the particles move an amount | v | dż in the direction of v to the 
position NN’. The area of the parallelogram MNN’M’ can be written as the 
product of the base ds and the projection of v dé on the outward normal (n) 
to (l), i.e. 
area MNN’‘M’ = |v | cos (v, n) dids, 


where | v | is the length of v. If (s) is the direction of the tangent to (l) on a 
counter-clockwise circuit, we have: 


(n, X)= (s, Y); (n, Y) = (8, X)— 7, (9) 
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where a symbol of the type (a, 8) denotes the angle measured counter- 
clockwise from direction a to 8. Hence: 


cos (n, X) = cos (s, Y) and cos(n, Y) = — cos (s, X). 
But we know that the angle between two directions is given by 
cos (v, n) = cos (v, X)cos (n, X) + cos (v, Y) cos (n, Y), 
so that by (9): 
cos (v, n) = cos (v, X) cos (8, Y) — cos (v, Y) cos (s, X). 
On substituting in the expression for the area and noting that 
|v | cos (v, X) = u; | v| cos (v, Y) =v 
ds cos (s, X) = Az; ds cos (8, Y ) = Ay, 


we get finally: 
area MNN’M’ = (— v da + u Ay) dt. 


If (v, n) is an obtuse angle here, cos (v, n) will be negative and an area will 
be obtained with a negative sign, which corresponds to the case when the fluid 
flows into the domain bounded by (i). 

The total quantity of fluid flowing past (2) in time dt becomes 


dt Z (~v4e tud) = at f —v dr + udy, 


and the quantity in unit time is 


q= Í [—vda + u dy), (10) 
(0) 


with the circuit round (1) taken counter-clockwise. We notice that (1) can 
be closed. The quantity q given by (10) has the (++) sign if the flow is in the 
direction of the normal (n), and the (—) sign if in the opposite direction. 

We remarked above on the direction of (n); it is bound up with the direction 
of the integration round (l) and with the orientation of the x and y axes in 
accordance with (9).If (1) is a closed contour and integration is carried out counter- 
clockwise (Fig. 60), g gives the difference in the quantities of fluid flowing in 
and out of the domain bounded by (l) in unit time, where either term of the 
difference can be missing. 

If (2) contains neither sources (which produce fluid: positive sources) nor 
sinks (which absorb fluid: negative sources), q must be zero, since otherwise 
the quantity of fluid inside (J) would increase or diminish, contradicting the 
properties of incompressibility and absence of sources. 

It follows that the established plane flow of an incompressible fluid is character- 
ized by the equation 

i [— vdr + udy]=0. (11) 


which must be satisfied for every closed contour (l) not containing sources. 
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4. The state of a body in thermodynamics is given by three physical magni- 
tudes: the pressure p, volume v and temperature T (absolute). These are con- 
nected by a relationship of the type 


f(x, p, T) = 9; 
in the case of an ideal gas, for instance, we have Clapeyron’s equation: 
pv— RT =0. 


The state of a body is thus defined by two of the three magnitudes, say by 
p and v, i.e. by the point M(p, v) on the pv plane. 

If the state changes, the point M must describe a curve on the pv plane, 
which is called a diagram of the process concerned; if the body returns to its 
original state, we speak of a cyclical process or cycle, and its diagram is a closed 
curve (J). 

We find the quantity of heat Q absorbed by a body during the process by 
dividing this up into small elementary processes in which small changes Ap, 
Av, AT occur in p, v, and T. If only one of these magnitudes changes, the quan- 
tity of heat absorbed is roughly proportional to the increment of the correspond- 
ing variable, whilst if all three change together, the total increment AQ is 
equal to the sum of the component increments, by the principle of addition of 
small operations [I, 68]. In other words, we have an approximate equation of 
the form 

AQ~AAp+ BAv+CAT 


and we get finally: 
Q= YAQ=SAdp+Bdv+Cadr. (12) 


On using the equation of state to express 7 in terms of v and p, we get: 
a : r OP ay: 
T = 7 (v, p); i e oe 


and finally, on substituting these values for T and dT in the right-hand side 
of (12), we find: 
Qe) a a 
( 


where P and V are known functions of v and p. 

5. Lot us consider the expansion or compression of the gas (steam) in the 
cylinder of a gas (steam) engine. The change in volume Av is proportional to 
the displacement of the piston in the cylinder due to the pressure p, so that the 
work AE done by the pressure in producing the change in volume is given by 
pAv with suitable choice of units; hence the total work done during one cycle is 


E= \ pdv. 
d 


68. Areas and line integrals. We find the area ø of the do- 
main (c), situated in the XY plane and bounded by the closed 
curve (l). We assume for simplicity (Fig. 61) that a line parallel to 
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the axis OY cuts (l) in not more than two points. Let y, and y, be 
respectively the ordinates of the points of entry into and exit from 
(c) of a line parallel to OY, and let a and b be the extreme values of 
the abscissae of points of (l). We 

y have [I, 101]: 


b 
o= f Ye -- Yı) dx. 


Let the so-called points of entry 
lie on section (1) of the curve, and 
similarly, those of exit on section 
(2). The integral: 


b 
Fic. 61 f Y, dx 
a 


is identica] with the line integral 

f ydg, 

2) 
with direction from the point x = a to x = b, and taken with the 
reverse sign. Similarly, 


b 
f yı da, 
a 
is identical with 
{ y dz, 
(i) 
taken from z = a to x = b. 
Finally we have: 
b b b! a 
o=fyde— fudem—| f yde + f yde|=— f yds, 03) 
a a Oai (2)b (U) 


where (l) runs in a counter-clockwise direction. 
We find by a similar method that 


o = f xdy. (14) 
Ò 
We also get, on adding and dividing by two: 


ae { ady—ydz. (15) 
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We deduced (13) by assuming that a line parallel to OY cuts (l) in 
not more than two points. The formula can easily be shown to be 
justified for more general contours. We first take the case when (øo) 
is bounded by curves (1) and (2) and by twostraight segments parallel 
to OY (Fig. 62). We find on going through the same argument that 


o= -| { ydr + ( yaz]. 
à) È) 


But z is constant over CD and BAand dz = 0, so that Sy dz is zero 
over these sections. We can add these integrals with the minus sign 
to the right-hand side and thus obtain (13) for the case in questions. 
When (c) has the more general type of contour (1) of Fig. 63, we pro- 
ceed as follows. We draw straight lines parallel to OY so as to divide 
(o) into a finite number of parts, to each of which (13) is applicable; 
addition of the expressions obtained gives us on the left-hand side 
the area o of the total domain, and the integral over (/) on the right, 


Fie. 62 Fic. 63 


the integrals over the additional contours being zero as above. Thus 
(13) is valid of the domain, and (14) and (15) may be justified for 
contours of general type in the same way. 

In the case of the ellipse 


x=acost; y= bsint (0 <t < 27) 
(15) gives: 
] Qn 


2n 
=> f (a cost- b cost + bsin t-a sin t) dt =~ ab i dt = xab. 
0 0 
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It is essential in the above formulae for an area for the circuit round 
(l) to be taken counter-clockwise, or more strictly, to be taken in the 
same direction as the 90° rotation needed to make OX coincide in 
direction with OY .If OY is directed downwards instead of upwards, the 
formulae remain in force provided a clockwise circuit is taken round 
(1). We shall always assume the above condition for the direction 
round a closed contour in a plane. 


69. Green’s formula. We now establish a fundamental formula 
relating an integral over a closed surface toa line integral over 
the contour of the surface. We start with the case when the surface is 
a plane domain, when we obtain what is usually known as Green’s 


formula. 
We use (7) of [56] to evaluate the double integral: 
f OP (x, y) do 
oy : 


(0) 
where P(x, y) is a function of (x, y). 
We integrate first with respect to y and assume that the contour (i) 
of (c) is intercepted at only two points by lines parallel to OY (Fig. 61); 
this gives us: 


bY: 
fa = Jfa da dy a a & dy = 


1 


b 
= fie (x, Y2) — P (x, 41)] dz. 


On the other hand, the integrals: 
b a 
f P(x, Y) dz, f P (z, Y2) da 
a b 


are the same as the line integrals: 


f P (x,y) de, 


taken respectively over parts (1) and (2) of contour (l) from the point 
z= qa to x =b. 
We get on changing the direction of integration of the second integral: 


b a a 
f P(%,y)du=— ù P(x, yds =— f P(x, y)dz, 
a ò (2o 
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whence 


a b 
oP 
f y L = — Í P(æ,y)de — | P (æ, y) de, 
(0) (2)b (ja 


or 


SfE — | Pads. (16) 
(0) (D 


where the circuit round (l) must be taken counter-clockwise (Fig. 61) 
We find in the same way the integral: 


J [O ao, 


where Q is another function of (x, y). Assuming for simplicity that (/) 
is intersected in only two points by lines parallel to OX, we have: 


B Xs 
[a= [J ae ay = Jax f 3 ar = 


B 
= | [Q(e,¥) — Q (a, y)] dy, 


and this can be similarly reduced to a line integral over a closed 


contour: 
3Q = 
IEZ = faas. (17) 


Subtraction of (16) from (17) gives us Green’s formula: 
JJ (32 -- $y) de = | Paz + aay, (18) 


We deduced (16) on the assumption that lines parallel to OY cut (l) 
in not more than two points. The same arguments as in the previous 
article can be used to show that the formula is justified for any type 
of contour. Similar remarks apply as regards (17) and (18). 
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The discussion is applicable to the case when (ø) is bounded by 
several curves (Fig. 64). The integration on the right-hand side of (18) 
is now over all the curves that bound the domain, in a counter-clockwise 
direction for the outer contour (with the axes directed as shown) 
but clockwise for the inner contours, i.e. so that the domain (ø) lies 

on the left over each contour. 
y We note that Green’s formula (18) 
can be written in another way. Let t 
be the tangent to l, taken in the same 
direction, and let n be the normal to J, 
. taken outwards from ø. The direction of 
t is found by turning n counter-clockwise 
through a right angle, so that we have 
for the angles formed by ¢ and n with 
the axes: (t, X)=n + (n, Y) and (t, Y ) = 
= (n, X). If ds is an elementary arc of the curve, dæ = ds’ cos (t, X) 
and dy = ds- cos (t, Y), so that dx = — ds: cos (n, Y) and dy = 
= ds: cos (n, X). If we substitute these in (18) and replace P by 
—Q)and Q by P, we get: 


Fie. 64 


J) e+ 2) do = JIP cos nx) + Q cos (x, ¥)] ds. 


Green’s formula in this form amounts to Ostrogradskii’s formula 
for a plane. 


70. Stokes’ formula. We now turn to the case of any non-closed 
surface (S) with contour / (Fig. 65). We retain the notation of [62], 
and assume that lines parallel to the z axis cut S in a single point. 
The projection of l? on the XY plane gives the contour (A) of the domain 
(oxy). We take the counter-clockwise circuit round (A) as positive, and 
similarly for (l). The direction n of the normal to S is taken so that it 
forms an acute angle with axis OZ and cos (n, Z) > 0. With this, the 
lower sign must be taken in expressions (24) of [62], which give: 


pcos (n, Z) = — cos (n, X); geos(n, Z) = —cos(n,X), (19) 


whilst (26) of [62] can be written as: 


do,., = cos (n, Z) d8. (20) 
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Let P(x, y, z) be a given function in the neighbourhood of surface 
(S) which is continuous and has continuous first order derivatives. 
We consider the integral: 


Í P(a,y, 2) da. 
(1) 


Points of (1) lie on (8) and therefore satisfy its equation: z = f(z, y), 
and we can replace the z under the integral sign by f(x, y). The inte- 
grand, P[x, y, f(z, y)], now con- 
tains only x and y. The coordi- 
nates (x,y) of a variable point 
of (A) are the same as for the 
corresponding point of (l), and 
integration round (l) can be 
replaced by integration round (A): 


| PEGA aes 


= J? le y, Í (x, y)] dz. 


We apply Green’s formula (18) Fic. 65 
to the integral on the right, with 
P= Pia,y, f(x, y)], Q = 0, and 
(1) becoming (A) in this case. We first find 0P/dy by differentiating P 
both directly with respect to y and via the third argument z, which 
we replaced by f(x, y): 


oP OP (a, y, 2) 
oy dy n 


ðP (x, Y, 2) T of (x, y) 
Gz ay ’ 


where f(x, y) is to be understood for z in the expression for P. We now 
have from (18): 


[Pi (x,y,z a de= | P [ey f(ey)] do = 
(0) 


Pf oP(a, y, 2) OP(a, y, 2) Of(@, y) 
(| ey + ð ` əy Jaen. 


(zy) 
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On expressing do,, in terms of the element dS of S in accordance 
with (20), we transform the double integral to an integral over (8) [63]: 


| Pe, y, 2) dx = 
(9) 


a yY?) Pe: y2) Ae y) 
E | [ + Fa | cos (n, Z) dS, 


and obtain finally, on using the second of i ae (19): 


Pdx = {{ p coa(n, Y) — £ cos (n, z)] dg. (21) 
(0) (S) 


If we are given two other functions, Q(z, y, z) and R(x, y, z), in the 
neighbourhood of (8), we can change the variables x, y, 2 cyclically 
to get two analogous expressions: 


[Ray = {I [2 cos (n, Z)— FB cos (n, X)| as 
(9) 


E ete X) — ŻE cos (n, Y) |as. 
0 (S) 


We arrive at Stokes’ formula by adding these three expressions: 


[Pi Ody + Raz = MEG X)+ 


4 (2 = 3E) cos (n, ¥) + ($2 — 3) cos (n, Z)|d8. (22) 


The formula relates the line integral round the contour of a sur- 
face to the integral over the surface itself, and bears some similarity 
to the Ostrogradskii formula of [63], which relates the integral over 
the surface of a three-dimensional domain to the integral over the 
domain itself. Green’s formula is the special case of Stokes’ formula 
when (8) is a plane domain in the XY plane; (l) now becomes a closed 
curve in the XY plane and dz = 0; (n) is along the z axis, so that 
cos (n, X) = cos (n, Y) = 0 and cos (n, Z) = 1. We obtain (18), on 
making these substitutions in (22). 

The same assumptions are made regarding the cosines in (22) as 
when deducing Ostrogradskii’s formula [63]. 

We arrived at (21) on the assumption that a straight line parallel 
to the z axis cuts (3) in a single point. If this is not the case, we divide 
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(S) with the aid of supplementary curves so that our condition is 
satisfied over each portion of (S), and (21) is applied to each portion 
in turn; addition of the results gives us the integral over (l) on the left, 
since the integrals are taken twice in opposite directions over the 
supplementary contours and cancel out; the double integral over the 
total surface (S) is obtained on the right. Hence, (21) is justified in the 
general case. The same arguments apply for the general formula (22). 
It is only necessary to observe the following condition here for the cir- 
cuit round (l) and the direction (n) of the normal: an observer travelling 
round (l1), with his height along the direction of the normal (n), must have 
the surface (S) on his left. This rule is bound up with the choice of the 
system of coordinates shown in Fig. 65. In this system, the observer 
with his height along OZ, sees OX become OY by a counter-clockwise 
rotation through an angle of 2/2. If the rotation were clockwise, 
“on his left” would have to be replaced by “on his right” in the above 
rule. 


If the notation of [64] is used for the surface integral, (22) may be written in 
the form 


[Pax + Qdy + Rdz = 
(9 


-JEE -e ovat (EB) cess 
(S) 


+ E = a) da dy. (23) 


The side of the surface (S) and the direction (n) are defined in accordance 
with the rule above. 


71. Independence of a line integral on the path in a plane. 
The examples in [67] showed that the value of a line integral is 
sometimes independent of the path of integration, though this is not 
always the case. We now use Green’s and Stokes’ formulae to show 
the conditions for independence of the integral on the path, and start 
with the case of a plane curve and the condition for independence of 


(B) 
4 Pdr + Qdy 
(A) 
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on the path. We join the points (A) and (B) with curves (1) and (2) 
(Fig. 66), when we must have 


(B) (B) 
(1) f Pdx+Qdy = (2) | Pde +Q dy, (24) 
(A) (A) 
or, on using property II of [66]: 
(B) (8) 
G) f Pdx+Qdy—(2) { Pdr-+Qdy=0, 
(A) (A) 


(B) (A) 
1) J Pdx + Qdy + (2) J P de +Q dy = | P de + Q dy = 0, (25) 
A B (I 


where (1) is a closed contour, formed by curve (1) taken from (A) to (B) 

and curve (2) taken from (B) to (A). 
y We see from this that the integral 
over any closed contour (l) must be 
zero, in view of the arbitrariness of 
points A and B. Conversely, if the 
integral over a closed contour (l) is 
zero, the integral over (1) is equal to 
the integral over (2), since (24) follows 
conversely from (25). If curves (1) 
and (2), that join A and B, intersect, 
we join A and B with another curve (3) 
that cuts neither (1) nor (2), and use the equations 


(8) (B) 
() f Pdz+Qdy=(3) | Pde + Qdy 
(A) (A) 


Fic. 66 


(B) (B) 
(2) { Pda + Qdy = (3) { Pda + Qdy 
(A) (A) 


to obtain: 
(B) (B) 
(1) | Pdr+Qdy=(2) | Pdz + Qdy. 
(A) (A) 

It follows that the condition for independence of the integral on the 
path is the same as the condition that the integral over any closed contour 
(1) vanishes. 

If the latter condition is satisfied, we obtain from (18): 


MOa 4° = 0 (26) 


where the domain of integration (ø) is entirely arbitrary. 
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We now prove the consequence, that 


3 aP 
a oy (27) 


is an identity, i.e. true for all x and y. 
Suppose that at a point O(a, b), 


differs from zero and is, say, positive. By the assumed continuity of 
ƏP/Əy and 3Q/da, f(x, y) must be positive over some small circle (o4) 
with centre C. We form the integral 


[freni [ER -Ear 
and apply the mean value theorem [61] to it: 


[E -Ea seme 


where (&, 7) is a point of (c,), so that f(&, n) > 0; hence it follows that 


[JCS - Fyeoo 


which contradicts the fact that integral (26) vanishes for any choice 

of (a). Condition (27) is thus necessary for the independence of the 

integral on the path. It is easily shown to be sufficient, since it follows 

from this, by (18), that [Pdz + Qdy vanishes for any closed contour, 
(Q) 


which is the same as the independence of the integral on the path. 
Condition (27) is thus necessary and sufficient for the integral 


(B) 
{ Pdz +Q dy (28) 
(A) 
to be independent of the path of integration, and to be a function of the 
coordinates of points A and B only. 
If this condition is satisfied and we fix the point A(x, Yo) whilst 
only the point B(x, y) varies, integral (28) is a function of (x, y), i.e. 
a function of the point B: 


"Bier Cape. (29) 


(Xos Yo) 
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We consider the properties of this function. On keeping y constant 
and giving x an increment Az, we get: 


(x+ 4x, y) (x,y) 
U(x + Az,y)—U(z,y)= f Pdxe+Qdy— f Pdr+ Q dy. 


(Xo, Yo) (Xo, Yo) 


Since the integral is independent of the path of integration, we can 

take the path for the first integral 

y as made up ofthe curve AB (Fig. 

67) as in the second integral, 

together with the straight line 

BB’. The integrals over AB cancel 
out, and we are left with: 


B/x,y) Bix+Ax,y) 


(x+ 4x, y) 
(x,y) 


x+4x 


Fie. 67 = f P(x,y)dx, 


x 


since y is constant on BB’, and dy = 0. We apply the mean value 
theorem [I, 95]: 


U(x + Ax, y) — U(x, y) = AxP(x + OAz,y) (0<6< 1). 


We divide by Az and let Ax — 0: 


ST = lim P(x + 042, y) = Pla, y). (30) 
E Ax-+0 


It can be shown in a similar manner that: 


ðU 


We find from (30) and (31) that [I, 68]: 
3U ðU 
dU = Oe dz +- dy = Pde + Q dy. 


Hence, if condition (27) is satisfied, the integrand 


Pdz + Qdy (32) 
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is the total differential of the function U(x, y) defined by (29). It is 
easily seen that the most general expression for the function U,(z, y), 
whose total differential is (32), is given by 
where C is an arbitrary constant. We have, in fact: 

dU = P dz + Q dy, 

dU, = Pdz + Q dy, 


whence 
d(U, — U) =0. 
If the differential of a function is identically zero, its partial deriv- 
atives with respect to the independent variables must be zero, and the 
function itself is therefore a constant, i.e. 


U,—U=0, 


which is what we wished to prove. 
We notice the obvious equality, valid when (27) is satisfied: 


8) (8) 
f Pde+Qdy= f dU, =U,(B)—U,(A). (34) 
(A) (A) 
Conversely, let U, exist such that: 
dU, = P dx + Qdy. (35) 
We show that now: 
aQ ə 
w ay TO 


and that U, is given by 


(x,y) 
Us(a,y) = f° [Pdr + Qdy] +0. 


(Xo Yo) 
Equation (35) can be written as 
U, 


ou, ə 
Oa dz + oy 


and since the differentials dz and dy of the independent variables are 
completely arbitrary [I, 68], it can only be satisfied when the coeffi- 
cients of dz and dy are the same on both sides, i.e. 


bo 
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whence it is clear that: 


aP U, SU, Q 


Oy + Ərxəðy —— Əyðææ ðr 
Hence (27) is fulfilled, and the integral 


(x,y) 
U(x, y) = f P dx + Q dy 
(Xo, Yo) 
is now, by the previous discussion, only dependent on (x, y) and has 


the property that: 
dU = Pde + Q dy = dU., 


whence it follows that: 
Ui a U + O, 


which was required to be proved. We can thus say: a necessary and 
sufficient condition for P dx + Q dy to be the total differential of a function 
U, is given by the identity: 


oP 3Q 
Oy “On? 
U, being then given by: 
(x,y) 
U,(%,y)= | Pdx+Qdy+C. (36) 


(Xo, Yo) 


72. Multiply connected domains. The proof of the necessity and 
sufficiency of (27) for the line integral 
(B) 


{ Pda +Qdy 
(A) 


to be independent of the path is based on the following two essential 
assumptions: 

(1) Functions P and Q and their first order partial derivatives must 
be continuous in the domain of variation of (x, y). 

(2) Whatever closed contour (l) is drawn in the domain, the part 
of the plane inside (i) belongs to the domain where (27) and the con- 
tinuity condition are satisfied. 

The first condition is important because the functions mentioned 
come under the integral sign in the proof. The second is needed for the 
application of Green’s formula, i.e. for the transformation from a 
line to a double integral. It is equivalent to the fact that any 
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closed contour traced in the domain can be contracted continuously 
to a point without moving out of the domain, or to put the matter 
more simply, the condition amounts to the domain having no gaps. 
We now suppose that P, Q and their derivatives are continuous and 
that (27) is satisfied in a domain (ø) with two gaps or holes (Fig. 68). 
If a closed contour (l) with no holes in its interior is taken in the 
domain, Green’s formula (18) is applicable to the contour and its 
interior, and by (27), the inte- 
gral round (/,) must be zero. We 
now take a closed contour (4) 
containing hole (I). We cannot 
apply (18) here, and in general, 
integral (28) round (l) does not 
vanish. We show that the value 
of this integral is independent of 
the form of (l), the only impor- 
tant factor being that the con- Fie. 68 
tour encircles the single hole (I). 
Let (L) and (l) be two contours encircling (I). We have to show that 
integral (28) round (7,) is the same as round (l). We join (L) and (l) 
by means of the extra contour (ab), so that the three together form a 
closed contour without a hole in the domain; the circuit round the 
total contour should be as shown by the arrows. We can apply (18) 
to the total contour and the integral round it must vanish, by (27): 


{+ f+ +f =o 
CO) (ba) C7) (ab) 


The integrals over (ba) and (ab) are taken in opposite directions and 
cancel out, whilst the integration is clockwise round (l) and counter- 
clockwise round (/,). We can change the direction of integration round 
(L) providing we change the sign in front of the integral, so that we get: 

‘ae 
ow) Ow 


or finally: 


f Pdr +Qdy = | Pdx+Qdy, 


ow) Ou) 


i.e. the integrals round (/,) and (l) are equal, provided they are both 
taken counter-clockwise in the usual way. It follows that hole (I) 
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corresponds to a definite constant œ, equal to the value of integral (28) 
taken over any closed contour encircling (I). Similarly, hole (IX) corre- 
sponds to another constant w. 

If we make two slits (ab) and (cd) in a domain (D) from the holes 
to the outer edge (Fig. 69), a new domain is obtained without holes, 
and by (27), we can form the single-valued function in the domain: 

y) 


U(z,y)= | Pde+Qdy; 


(Xo Yo) 


but, by the above, the value of the function changes by a constant œ, 
on crossing (ab), and by œ, on crossing (cd). If we remove the slits 
and return to the original do- 
main (D), the function U (x, y) 
is many-valued. Circuits round 
the holes imply adding w, and 
w to the function, i.e. U(x, y) 
includes an arbitrary term of the 
form m, œ + Mm, œ, where m, 
and m, are arbitrary integers. 
Our arguments are obviously 
all applicable to any number 
of holes in a domain, inelud- 
ing the case when a hole con- 
sists of a single point. The degree of connectivity of a domain (D) 
is defined as the number of holes plus one; the domain with holes 
is said to be multiply connected. Constants œ, and œ, are called the 
circulations of P dz + Q dy or the cyclical constants of U (x, y). 


Fia. 69 


Example. We take the function 
= we 
gy = arc tan et 


given in a domain (D) bounded by two concentric circles with centre at the 


origin. 
We define P and Q by: 
Bs Aaya NR 
Peng a +y 
37 
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These functions are continuous with their derivatives in (D) and are easily 
shown to satisfy (27). We take the line integral 


—y dz +-ady 


[Pas + ay = | ay 


10) (0) 


and integrate round circle (/,) with centre at the origin and radius a. We get 
by substituting x = a cos p; y = a sin p: 


f —ydax-+ady 


27 
aF yi = fap = an. 
0 


(L) 


The domain (D) has one hole in this case, and the cyclical constant w, = 22. 
The function U(x, y) is the polar angle: 


3 a 


and it increases by 2z on a circuit round the hole. We note that the radius of 
the inner circle can be taken as zero, when we have a point-hole, i.e. we exclude 


the origin (0, 0). By (37), P and Q take the indeterminate form 0/0 at the 
origin. 


73. Independence of a line integral on the path in space. Just as 
in the case of a plane, the condition for a line integral to be indepen- 
dent of the path in space amounts to the vanishing of the integral 
round any closed contour. We take the integral 


A Pde+ Qdy + Rdz. (38) 


We can show as above, on using Stokes’ formula (22), that the 
necessary and sufficient conditions for the independence of integral (38) 
on the path consist of the three identities: 


Tni. (39) 


If these conditions are satisfied, we can form a function U of points 
(x, Y, 2): 


(x, ¥, 2) 
U (x, y. z) = i Pdx+Qdy + Rdz, (40) 


(Xo. Yor Zo) 
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where it can be shown by a method similar to the previous one that 


0U 0U 0U 


a =P: Sy =e g TE (41) 
P dz + Q dy + Rdz = dU (42) 

(B) 
Pdz + Qdy + Rdz = U (B) — U (4). (43) 


(A) 


Furthermore, conditions (39) are necessary and sufficient for the 
expression P dz + Q dy + R dzto be the total differential of a function 
U,, where U, is given by: 

(x,y, 2) 
U,= | Pdx+Qdy+ Rdz+C, 


(Xo, Yos Ze) 


where C is an arbitrary constant. 

Multiply connected domains in space have certain special features. 
We take the example of the domain (D) formed by the interior of a 
sphere, from which two tubular portions (I) and (II) are cut out, with 
their ends on the surface of the sphere, as 
shown in Fig. 70. If we take a closed contour 
(L) round tube (I), a surface contained in 
(D) cannot be shrunk on to it, so that even 
if conditions (39) are satisfied in (D), Stokes’ 
formula cannot be applied to (4), and inte- 
gral (38) round (l) will in general differ from 
zero. The non-zero value of the integral 
will be independent of the shape of (/,), how- 
ever; it gives the cyclical constant œ for 
tube (I) and is the same for any closed con- 
tour in (D) that encircles (I). Similarly, we 
get a second cyclical constant w, for tube (II). The function U(x, y, z) 
given by (40) is many-valued in this case and includes the undefined 
term m; @, + M, @,, where m; and m, are arbitrary integers. 

We notice that if (D) consists of the space between two concentric 
spheres, and conditions (39) are satisfied in (D), there are no cyclical 
constants and (40) is a single-valued function. This is obvious geo- 
metrically from the fact that a surface lying in (D) can be shrunk 
to any closed contour in (D), so that Stokes’ formula (22) can be 
applied to the contour; the vanishing of the integral round the con- 
tour then follows from (39). 


Fie. 70 
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Example. We consider the angle pọ appearing in systems of cylindrical and 
spherical coordinates: 


y 
=arct f 
pọ = are tan -> 


and we use (37) to define P and Q. These latter take the indeterminate form 
0/0 all along the z axis. When we consider the line integral in space 
—yda + x dy 

x + y? 


fe dz + Q dy = f 
@ © Ò 


we have to exclude the tube running along the z axis, the cyclical constant 27 
being obtained as the value of the integral written round any closed contour 
encircling this tube. 


74. Steady-state flow of fluids. Let v(x, y) be the velocity vector of the 
steady flow of an incompressible fluid in a plane, and let u(x, y), v(x, y) be the 
projections of the vector on the axes. We saw in Example 3 [67] that the 
absence of sources implies the vanishing of the integral 


f—vdx + udy (44) 
() 


over any closed contour, which is equivalent to the integral being independent 
of the path; the necessary and sufficient condition for this is, by (27): 


(45) . 


which characterizes an incompressible fluid. When (45) is satisfied, the expres- 
sion 
— v dy + u dy 


is the total differential of some function y(M), given by the relationship 


(B) 
y (B)—y (4) = | — vds + udy. (46) 
(4) 

Function y(M) is called thestream function and has a simple physical signifi- 
cance: the quantity of fluid flowing per unit time past an arbitrary contour start- 
ing at A and finishing at B is given by y(B) — (A). This follows immediately 
from formula (10) for the quantity of fluid [67]. 

If certain points of the domain are sources, we can exclude these points 
and get a domain with holes in which (45) is valid. The cyclical constant for 
a hole is equal to integral (44) over a contour encircling the hole, and evidently 
gives the quantity of fluid q produced by the corresponding source in unit time. 
The function y(M) is now many-valued. If q < 0, the source is negative (a 
sink). 

We consider, in addition to (44), the integral 


f udz + vdy, (47) 
0) 
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the value of which is usually called the speed of circulation along contour (l). 
We take the speed of circulation over a closed contour as zero, i.e. (47) is inde- 
pendent of the path. This is the same as saying that the flow has no vortices. 
With this assumption, a function 9(M) exists: 


(x, y) 
p= f udx+vdy, (48) 
(xo, Yo) 


such that the projections u and v of vector v are its partial derivatives: 


3p, ð 
u = On > v= Oy . (49) 
The function g is called the velocity potential. If we have independence of 
integral (48) on the path in a multiply connected domain (a domain with holes), 
the velocity potential ọ will in general be a many-valued function and the cyclical 
constants of (48) with respect to each hole will give the strength of the correspond- 
ing vortex. 
Equation (46) gives us [71]: 


On comparing these equations with (49), we get two equations relating the 
velocity potential » and the stream function y: 


3p w, dr 5 
Qa Oy’ y Oa’ (50) 


These are the Cauchy- Riemann equations; they are of fundamental importance 
in the theory of functions of a complex variable and their hydrodynamical 
significance, established above, serves as a basis for the wide applications of 
the theory of functions of complex variables to plane problems of hydro- 
dynamics. 

In the case of steady-state motion in space, the velocity vector w(x, y, z) 
has three components: u(x, y, z), u(x, Y, z), w(x, yY, z), whilst we have to consider, 
instead of (48), 

f u dæ + v dy + w dz, 
9) 


the conditions for this to be independent of the path being: 


Ow Ov 0: ou Ow 0 Ov Ou 


Oy oz > 02 On” oe ay P 


if these conditions are fulfilled, there exists the velocity potential 


(x, y, 2) 
g= f u dx + v dy + w dz, 
(Xo, Yos Zo) 
where 
Op ap , ap 
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The generalization of condition (45) for incompressibility to the case of 
space will be found in the next chapter. 


75. Integrating factors. If 


Pdz -+ Qdy (51) 
is not a total differential, i.e. 
oP (4) 
op Oat 0, 


it can be shown that a function u can always be found which yields 
a total differential when multiplied by (51), i.e. 


(Pdr + Qdy) =dU. (52) 


The function u is called an integrating factor of (51). 
By [71], a necessary and sufficient condition for u to be an integra- 
ting factor of (51) is for the equation 


a (uP) 3U) _ 
“oy ee Cy 


to be satisfied; this can be rewritten as 
ay Ou aP 8Q) __ 
ele tele a) = (54) 


which can be looked on as an equation for u. This is in general difficult 
to use, however, since, being a partial differential equation, its 
integration is likely to be more awkward than would be the case 
with an ordinary equation. 

If (51) is a total differential, the equation 


Pdz+@Qdy=0 (55) 


is called an exact differential equation. 
It may be integrated directly. Let U be the function for which 


dU = Pde + Qdy. 


Given our assumptions, which are equivalent to condition (27), 
this function can always be found from (29). Equation (55) is the 
same as dU = 0, i.e. 

U =0, (56) 


this being the general solution of the given equation (55). 
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Now suppose that (51) is not a total differential. By the existence 
theorem [51], (55) will always have a general solution, which may be 
written in the form: 


F(x,y) =C. 
The function F(x, y) must satisfy the relationship 
OF (x, y) OF (a, y) dy _ 
Se by de h 


where dy/dz must be replaced, using (55), by (— P/Q), i.e. we must 
have the identity [7]: 


or OF 
Pr gs 
P Q” 
If u is the common value of these ratios, we have 
OF oF 
Ge MPS Gy = HG 


ie. u is an integrating factor of (51). 

The above shows that every expression P dx + Q dy has an integrat- 
ing factor. 

If we find an integrating factor of (51), and from this the function F, 
the general solution of (55) can be written at once as 


F=C. 
Examples. 1. The differential equation of a stream line in steady state 
plane fluid flow is [52] 


dg d 
u o woe — vdr + udy =0, (57) 
where u and v are the projections of the velocity v on the axes. If the fluid is 


incompressible, we have: 


ðu ow 
tet Gy = 
which shows that 
— v dx + udy (58) 


is a total differential; we saw in [74], in fact, that 
—v dgr + u dy = dy. 
where y is the stream function; the equation of a stream line is 
y=, 
which is in fact the general solution of the equation: 


—vdg + u dy =0. 
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2. We discussed elementary heat processes in Example 4 of [67] and gave 
expressions for the infinitesimal quantity of heat obtained in the process 
as a function of the iafinitesimal changes in pressure p, volume v, and tem- 


perature T. 
We have three expressions for dQ, depending on which pair of p, v, T is taken 
for the independent variables: 


c,dT +¢,dv (T, v— independent variables) 
dQ = Cp dT + Cy dp (T, p— ” ” ) (59) 
| Paps Vav (p, v— s is ). 


The fundamentally important quantities c, and c, are called the specific 
heats of the body at constant volume and constant pressure respectively. 

On expressing one of the independent variables of (59) in terms of the other 
two, we get a series of relationships between the coefficients. 

For instance, let T and v be the independent variables in the equation 

epdT + cgdp = c, dT + c, Ww (60) 
We write: 
_ op op 


and by substituting this for dp in (60) and equating coefficients of dT and dv, 
we get: 


G) 
Cy = Cp + Cg SF (61) 
3 
Cy = Ce E ; (62) 


In the same way, we use the equation: 


c,dT + c, do = P dp + V dv 


to obtain: 
ap 
Cy = P Or Š (63) 
3 
¢=V+P eo ; (64) 


We have the equation of state, in the case of an ideal gas: 


pv = RT, 
whence the relationships follow: 
op R Op  p, %w R ow v T _ č v OF _ p 
oT v’ w wv?’ OP v’ Op p’ Op R? w` R?’ 


whilst equations (61) to (64) give: 


R 
Cy = Cp + Cg —— 5 G = — e 3 C= P ; o =—P 2 4y. (65) 
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These equations enable c,, c,, P, and V to be expressed in terms of the basic 
magnitudes c, and cp: 


v v 
a= (p— e): a=— (pe) 3 Pes V=. (66) 


The expression for dQ is not in general a total differential. But by the two 
basic principles of thermodynamics, it can be stated that: 

I. The difference between dQ and the elementary work of the pressure, 
pdv, is a total differential: 


dQ — p dv = dU, 


where U is called the internal energy. 
II. The result of dividing dQ by the absolute temperature T is a total dif- 
ferential; in other words, 1/7’ is an integrating factor of dQ: 
es 
T A , 
where S is called the entropy. 
Rule I gives us, by the first of expressions (59): 


dU = dQ — p dv = c, dT + (c, — p) dv, 
whence 
acs | _ alcı — p) 
Ov T oT v : 


(67) 


Subscripts T and v refer to the variables that are reckoned constant during 
the differentiations. 
Similarly, rule II gives: 


a 
© 
e 
> 


E hae 
whence 
l 3c =F (4 1 ĉc Cy 
T wlr OT \ TS, T Fh TT?’ 
or 
cy | _ 3c, A 
dlr OT |, T (68) 
We find by comparing (67) and (68) that 
Op 4 
OF a: (69) 
We obtain from this, on returning to the case of an ideal gas: 
Op R c, ee eee 
ap tg me ge (70) 


c, = p = (cp — ey), OT Cy—c,= R. (71) 
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It may be taken from experimental data that: 
III. The specific heat c, of an ideal gas at constant pressure is a constant, 
so that c, = cp — R is also a constant. 


It follows from (71) that cp > ¢,, and if we write 


Cy 
where k > 1, we easily get, on using (66) and (71): 


v k 
Se rt ER pee) EE 


after which, (59) gives the following expressions for dQ, dU, dS: 


cy, dT + pv 
ag = cp dT — v dp (12) 
v dp + kp dv 
k—1 
dU =o, dT (73) 
aT dT d 
dS = o, -7 E a n . (74) 


The temperature remains constant in isothermal processes, ie. dT = 0 
and 
dQ = p dv, 


i.e. all the absorbed heat goes into work done by the pressure and the total 
change in the quantity of absorbed heat on passing from volume v, to v, is 


(v) 
f pa. 
(») 


The graph of a constant temperature process is called an isothermal. 
An adiabatic process is one in which there is neither influx nor loss of heat, 
and it is characterized by the condition: 


dQ=0 or dS=0; S= const., 
or constant entropy. We can find the entropy from (74): 
S = c log T + Rlogv +0, 
so that an adiabatic process is characterized by 
c, log T + Rlog v = const., 
or, in non-logarithmic form: 
Tee YR = Tr vêp te = const., 
or alternatively, on raising to the power 1/c,: 


Trk—-1 = const., 
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whence finally, inasmuch as T = pv/R: 
pif = const. (75) 
Finally, we have at constant volume, when dv = 0: 


dR =c dT; dR = c, (T, — T3), (76) 


if the gas passes from temperature T, to T}. 


76. Exact differential equations in the case of three variables, We 
obtain on generalizing (55) for three variables: 


Pdr + Qdy + Rdz = 0, (77) 


where P,Q and R are given functions of (x, y, z). If conditions (39) 
are satisfied, the left-hand side of (77) is the total differential of a 
function U(x, y, z), and the general solution of (77) becomes: 


U(x,y, 2) =C, (78) 


where C is an arbitrary constant. Geometrically, (78) gives a family 
of surfaces in space. Assuming that the left-hand side of (77) is not 
a total differential, an integrating factor is sought, i.e. a function 
u(x, y, 2) such that the left-hand side of the equation 


u(P da + Qdy + Rdz)=0 (79) 
is in fact a total differential. Conditions (39) now give: 
aR) _ IW) o. O(P) _ _uR) o. 
ay az ? az ae tt” 
uQ) _ uP) o 
dx dy i 


which can be written as: 


(2 — 8) =o Roe. 


oy az dz oy ” 
IQ OP) põue pab, 

bar) Pa oe oe) 
@P ƏR 3u Ou 

(oases Pa 


If we multiply these equations by P,Q, R respectively, add, and 
cancel u, we get a relationship between P,Q, R: 


Pg e)ta ee oe 


j+ a(R -Eo on 
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Having assumed the existence of the integrating factor u, we have 
thus arrived at the necessary condition (81) which must be satisfied 
by P,Q, R. We shall not dwell on the proof that the condition is also 
sufficient, ie. equation (77) does not always possess an integrating 
factor, the necessary and sufficient condition for the existence of such 
a factor being given by (81). If u exists, the left-hand side of (79) is 
the total differential of a function U, and (78) gives the general 
solution of equations (7¢) and (77). On the other hand, if condition 
(81) is not satisfied, (77) does not possess a general solution of the form 
(78). We sometimes refer to (81) as the condition for exact integrability 
of equation (77). 

The geometrical significance of (77) and its general solution (78), 
when the latter exists, is as follows. The functions 


P(x, y,2), Qx,y,2), R(x, y, 2) 
may be looked on as defining the projections on the axes of a vector 
v(x, y, 2) at every point. The system of differential equations 
ae dy o 
P Q R 

defines a family of curves (L) in space, such that at every point of 
these the corresponding vector v is directed along the tangent. The 
stream lines of established flow in [52] played an exactly analogous 
role. Equation (77) is equivalent to the condition for an infinitesimal 
displacement with components dz, dy, dz to be perpendicular to 
vector v, that is, (77) defines a plane element at every point per- 
pendicular to v, or in other words, an element lying in the normal 
plane to the curve (L) which passes through the given point. The 
general solution (78) also gives a family of surfaces whose tangent 
planes at every point are normal to v. The surfaces (78) are thus 
orthogonal to the curves (L). If a family of curves (L) is given which 
fills space, a tangential vector v can be found at every point; if the 
length of the vector is taken as unity, its components are P,Q, R, 
and equation (77) can be obtained. Equation (81) now gives the 
condition for the given family of curves (L) to be orthogonal to 
some family of surfaces. 


77. Change of variables in double integrals. We conclude the present 
section by deducing the formulae given in [57] for the change of 
variables in a double integral. Let the transformations be given: 


x = p(u, v); Y= y(u, v), (82) 
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where we look on (x,y) and (u,v) as the Cartesian coordinates of 
points in a plane. Equations (82) give the transformation of the 
plane such that the point (u,v) becomes the point (x, y). Let (c), 
(o) be domains in the plane with contours (4), (l) respectively. We 
assume that : (1) the functions of (82) are continuous together with 
their first order derivatives in the domain (01) up to (L); (2) equations 
(82) define a one-to-one correspondence between (o) with contour (1,) 
and (ø) with contour (J), i.e. for every point (u,v) of (o,) there is 
a corresponding single point (x, y) of (o), and conversely, for every 
point of (ø) there is one corresponding point of (0,); (3) the Jacobian 
of the functions of (82) with respect to (u, v): 


Diy, y) _ plu, o) Sole v) _ puo) aA v) (83) 
D(u,v) ~ Ou ov 


preserves the same sign he (o). 

We shall say that the correspondence between (0) and (c) is direct, 
if, on making a counter-clockwise circuit round (l), the point (x, y) 
makes a counter-clockwise circuit round (l). The correspondence is 
called reverse in the opposite case, when a circuit round (L) implies 
a circuit in the opposite direction round (1). The area o of the domain 
(o) is given by the integral [68]: 

o= (ady, 
a 
where the integration is carried out counter-clockwise. 
We find on introducing the new variables in accordance with (82): 


= + | ola, v) dylu, v) = + {oz P ar y du + ea 2 dv. (84) 
(4) (L) 

Let the integration round (}) be counter-clockwise. This direction is 
in fact obtained round (l) after the transformation if the correspond- 
ence is direct, so that we must take the (++) sign in (84). If the 
correspondence is reverse, we get the opposite direction after trans- 
formation to that round (l); if we now write the (—) sign in front, 
we can again integrate counter-clockwise. 

We apply Green’s formula (18) to (84), after setting x = u, y =v, 
P = p dfu, Q = p dy/dv. We have: 

3 _ 3P _ Dy) 
ðu Ov D(u,v) ’ 


o= + J-e ie 8 du dv. 
C) 


(85) 


and hence: 
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On applying the mean value theorem [61] to the double integral, 
we have: 


Dip, 
o=+0,| (p, Y) 


Du, v) lg , (86) 


where we take the value of the Jacobian (83) at a point (ta Va) of (0;). 
Since o and o, are positive, a consequence of this last expression is 
that, if the correspondence is direct, the Jacobian (83) is positive, 
whilst it is negative if the correspondence is reverse. 

We now come to the formula for change of variables. Let f(x, y) 
be a continuous function in (ø). Let (c) be divided into sections 
Ti, T3, ---» Tm By (82), there will be a corresponding division of (e) 
into sections Ti, Tọ ..., Tm. We shall use the same symbols tj, ty to 
denote the areas of the sections. We have by (86): 


De, y) 


KN + 
Tk = Tk D(u, v) 


(ue, 7x)” 


where (up, vk) is a point of ti. We have the corresponding point 
Ly = Plur Vx), Yk = YP(Ux, Vx), and we can write: 


: S Dip, Y) | j 
PA Yy) Tk = = Fpl Vr) YP(Uk v,) | ae (aes "Tke 


On passing to the limit, we get the formula for change of variables 
in a double integral: 


IEG y) dx dy = ff flolu, v), plu, v)] | ih 
(0) é) 


du dv, (87) 


which is the same as (13) of [57]. 

We notice a consequence of (86). Let (c,) contract indefinitely to 
the point (u, v). The point (uo, vo) of (o,) now tends to (u, v), whilst 
(oc) contracts indefinitely to the point (x, y). We obtain from (86), 
on passing to the limit: 


D(p, y) 
D(u, v) 


. o 
= lim —, 
9; 


i.e. the limit of the ratio of areas is the absolute value of the Jacobian 
at the corresponding point, as we already mentioned in [57]. Similarly, 
if we look on the function of a single variable x = f(u) as transforming 
points on a straight line, so that the point with abscissa u becomes a 
point with abscissa x, the absolute value of the derivative | /’(u) | 
gives the limit of the ratio of corresponding lengths on the line, 
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or in other words, the coefficient of the linear change produced by 
the transformation at the point of abscissa u. 

We remark that we had to use the second derivative 3?%p/ðu dv and 
its independence of the order of differentiation when obtaining (85). 
We must therefore, strictly speaking, add the further assumption to 
those at the beginning of the section of the existence and continuity 
of 3%] du dv, whence follows its independence of the order of differen- 
tiation, as we know from [I, 155]. 

If (v) is a domain of space, bounded by the surface (S), we can 
apply Ostrogradskii’s formula [63] with P =@Q=0 and R =z to 
express the volume v of the domain as an integral over the surface: 


v= f f z cos (n, Z)d8. 
(S) 
By using this expression for the volume, we can prove the formula 
for change of variables in a triple integral [60] in much the same 
way as above for a double integral. 


§ 8. Improper integrals 
and integrals that depend on a parameter 


78. Integration under the integral sign. When evaluating multiple 
integrals we encountered definite integrals in which the integrands 
and even the limits of integration depend on a variable parameter. 
We now pause to consider these integrals in some detail. 

We take the integral: 


Xs 
I(y)= § f(x, y)da, 
x, 
where a denotes the variable of integration and where the integrand 
also depends on the variable parameter y, on which the limits also 
depend. It is clear that in this case the result of integration I(y) 
will in general be a function of y. Formula (7) of [56]: 


B Xs b Ya 
fIwdy= fay ffeyda= fds Ffendy (1) 
a Xı a Yı 


a 


is known as the formula for integrating a definite integral with respect 
to the parameter under the integral sign. It reduces to a very simple 
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form when the limits 2, and x, are constants a,b, independent 
of y [56]: 


b B 
(Tq) dy= fay f f@eyde= fde (fleydy. (2) 


The integrand f(z, y) is assumed continuous with respect to both 
variables in the domain of integration in all these formulae, and the 
domain is assumed to be finite. We are concerned in other eases 
with improper multiple integrals, which are dealt with later. 


Example. The above method is sometimes used to evaluate definite 
integrals when the indefinite integrals are unknown. We use it to evaluate 
Laplace’s integral: 


= y 
I= f o= de. (3) 
0 


Let (D’) be the quadrant of a circle with radius 
r and centre at the origin lying in the first quadrant 
of the axes; let (D”) be the square bounded by the 
lines x = 0, x =r, y=0, y =r; and finally, lt 0 r 
(D’’’) be the quadrant of the circle with centre at Fie. 71 
the origin and radius r V2 (Fig. 71). Clearly, (D’) is 
part of (D”), and (D”) is part of (D’’’). We take the 
double integral over these domains of the positive function e~*~”, and 
have the obvious inequalities: 


3 


f fer” dx dy < f feo? dx dy < f f e-*-Nda dy. 
(D) (D") (D’””’) 


We introduce polar coordinates x = @ cos p, y = ọ sin p, and obtain [56]: 


f f eo? da dy = 
(D) 


ou, ro] 4 


7 
-e ode = -Z [L ore | 2 et 
dp fo o dọ = zf 7 ê la a (l—e™). 
0 


We have on replacing r by r /2: 
f fewer da dy = > (l—e7*"), 
(D°) 

Integration over the square (D”) gives: 


r r T 
f f e-P-Y dady = f e7% dr. f e” dy = (f e™* dz)?, 
T 0 o : 
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and the inequality written above takes the form: 
áá ~r2 ~x2 T —2r2 
Ele Py (f e~ dx)? <—-(1—e 2?) 
The two extreme terms of the inequality tend to 2/4 as r tends to infinity, 


so that the middle term must tend to the same limit; hence we obtain for 
integral (3): 


a, Va 
J e “dr= -z (4) 
0 
It is easily seen [I, 96] that: 
+œ , co = 
Se dr=2 f eV de= yz. (5) 
—o 0 


If use is made of the improper integral over the whole of the first quadrant 
of the axes, denoted here by (P), the above result follows almost at once. 
We have: 


ff e-*-Nda dy = fe dz. f e7} dy = FP, 
(P) 0 0 


and on introducing polar coordinates: 


z 
2 o 
2 -o2 = -e EA SE. S ee 
I A ededp = fap | ede >| ye 7” 
P ò ò 


whence I = Yx/2, which agrees with the value found above. 


79. Dirichlet’s formula. If we specify x, and x, in (1) as functions of y and 
the interval (a, 8) of variation of y, we thereby define a domain (a) of the xy 
plane. A frequent case in applications is that when the domain becomes an 
isosceles triangle, formed by the three lines (Fig. 72): 


Y=u; L=a; yb. 


Jf we reduce a double integral over this triangle to an iterated integral, in 
one case by integrating first with respect to x, then with respect to y, and in 
the other case by integrating first with respect to y, then with respect to 2, 
we arrive at: 


b y b b 
$ dy $ f(a,y)da= f dz $ f(x, y)dy, (6) 


a a x 


which is known as Dirichlet’s formula. 
Example. Abel’s problem. To find the curve in a vertical plane such that a 
material particle setting out with zero initial velocity from any point M of the curve 
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of height h (Fig. 73) and falling along the curve, reaches its lowest point O in a 
time T which is a given function of the height h: 

T =o (h). 


We tako OY vertically upwards, OX horizontally, and the origin at the 
owest point of the curve; the equation of the curve is sought as: 


x =f (y). 


We write: 


ds = dy V 1 + [F (9)? = u (y) dy; u(y) =Vi+ [F E. (7) 


We know from dynamics that the increment in kinetic energy when the particle 
moves from the initial position M to N is equal] to the work done by gravity, 


Fie. 73 


since the reaction of the curve is perpendicular to the displacement of the 
particle and hence does no work; thus, 


1 ds 
—_— 2 ae — . = 
zm mg(h—y); v a 
or 
l1 ( ds 
3 ( a ) g(h — y) 
— ds 1 — 
a = u(y) dy, 


where the (—) sign is taken because the height y of the particle decreases with 
increasing t. 

The time of descent from M to O corresponds to y varying from h to 0, 
so that: 


h 
l u(y) dy 
hè =T= , 
g(h) 25. A ae (8) 


We thus have to find u(y) from equation (8); this is called an integral equation 
since the unknown u(y) occurs under an integral sign. 
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We multiply both sides of (8) by 1//z—h, and integrate with respect to 
h between the limits 0 and z: 


. g(h) als aes uy) dy dy 
yz— i Yz—h J Vh—y =a 


The iterated integral on the right can be transformed as follows by Dirichlet’s 


fi ula: 
ormula ; 2 ia a 
u(y) dy _ WY dh 
ales y -f3 Sty 


z z (9) 
í dh 
= da —__ 
J au "J TEE 


There is no great difficulty in evaluating the inner integral, if a new variable 
t is introduced in accordance with: 


h= y + t(z—y). 
When A varies from y to z, ¢ varies from 0 to 1, and we have: 


z—h = (z—y)(1— t); h—y= (z—y)t; dh = (z — y) dt, 


whence 
z 1 1 
PEPE Le A f ee. n 
Tr (h—y) J t(1 —t) 1 1y 
ETS eH 
= arc sin(2é— 1) Ne = arc sin 1 — arc sin (— 1) = + — (— +) = 7 
and finally: 
n. p(k) dh 
5 Z futja v=] 
0 
or 
J u(y) dy = Fee), (10) 


0 


where F(z) is a known function of z, given by: 


z 


Fe) = e es dh . 
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We find on differentiating (10) with respect to z: 


dF(z) _ ¥2g d f 9(k)dh 


dep ae ae) eee 
0 
which solves the problem, since, on knowing u(y), x = f(y) may be found without 
difficulty from (7). 
We work out the problem in full for the particular case of the isochronous 
curve, for which the general time of descent is independent of the height h, 
i.e. 


u(2) = (11) 


o(h) = const. = c. 
We have here: 


ge) ee a 
cVy2g 
u z) = = 
( ayz 
From (7), we have the following equation from which x = f(y) can be deter- 
mined: 
_ 2g (dy)? A x _ 2go ) 
(da)? + ay = = E E aye (4 =-5). 
We write: 
y =a(l + cost); dy=—asintdi; A=2a- 

We now find: 


22 
2 


de = dy |/ 24 ~1= ioe . (— asin t) dt = — 2a sin? dt; 


l + cosż 
@ = £,— a(t — sin t), 


where x, is a constant of integration. The reader can easily show that the 
curve is a cycloid, in a slightly different disposition to the cycloid of [I, 79]. 

We show later how the differentiation with respect to z is carried out in the 
general formula (11). 

We notice some points regarding the solution obtained. The integral equation 
(8) was solved on the basis of the assumption that a solution exists. Solution 
(11) ought to be checked, strictly speaking, i.e. (11) substituted for u(z) in 
(8) and the equality of the left and right-hand sides shown. A further point: 
double integral (9) is improper in the sense that the integrand tends to infinity. 
We shall see later that (9) exists, and we can easily show that expression (1) 
for reducing it to an iterated integral, is applicable in this case. 


80. Differentiation under the integral sign. We take an integral 
depending on a parameter y: 


b 
Ily) = f f(x, y) da. (12) 
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We take the limits a and b as at present independent of y. We 
assume that f(x,y) is continuous and has a continuous partial 
derivative Əf(x, y)/ðy in the rectangle: a<u<bs a<y< B. 
We show that with these assumptions a derivative dI(y)/dy exists 
which can be obtained by differentiation with respect to y under the 
integral sign, i.e. 


b b 
d 
I [iey de = [LEY av. (13) 


The increment Al(y) of I(y) is given by: 
Ally) = Hy + 4y) — Ty) = fi [fay + Ay) — fle, yee. (14) 
We get from the formula for finite increments: 
flee, y + Ay) —fla,y) = dy PIED (9 <9 <1). (15) 
We use the uniform continuity of df(x, y)/oy in the rectangle 


mentioned to write 

afz, y +04 afz, 

Sey — OY + yia, y, Ay) (16) 
where n(x, y, Ay) tends uniformly to zero with respect to x and y 
as Ay->0, ie. given any positive e, there exists a ô such that 
| n(x, y, Ay) | < £ provided | Ay | < ô. Hence it follows that 


| f n(x, y, Ay) dæ |< f e dx = e(b — a) (\Ay} < ô), 


and since e is arbitrarily small, we have 


b 
f n(u, y, dy) dz > 0 as Ay —> 0. (17) 


a 


We return to (14). Noticing that Jy is independent of x, and 
using (15) and (16), we can write: 


? afm, y) ° 
Al(y) = Ay f g da + dy f n(x, dy) de. 
a a 


On dividing by Ay and passing to the limit, we get by (17): 


b 
lim 22 =f Of(@, y) dz, 
ayo 2Y oy 
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i.e. we have proved (13). We remark that if the continuity of f(z, y) 
only is assumed, it still follows, from (14) and the fact that [f(x, y + 
+ Ay) — f(x, y)] tends uniformly to zero with respect to x and y as 
Ay—> 0, that Z(y) is a continuous function of y. 

With the same assumptions regarding f(x, y), we now consider 


I (y)= f fle,y)de, (18) 


where the limits of integration belong to the interval (a, b) and are 
assumed continuous functions of y, with derivatives with respect to y. 

Let Az,, Ax, be the respective increments of 2, £, when y receives 
the increment Ay. 


We have: 
Al, (y) = L (y + Ay) — L (y4) = 
Xat AX, Xa 
= f jæ, y+ 4y)dz— f f(x,y) de. (19) 

Xı+4Xı Xi 
On recalling [I, 94] that: 

Xt AX, Xe X+ 4X4 Xt dx 

Te e E 
X,+ 4X, xy Xe Xe 


we can re-arrange (19) as: 


AI, (y) = §" [Fey + 4y) — fle, y) de + 


Xat dx, Xt 4x, 
+ f f(a,yt+ Ay) de— f f(x,y + Ay) de. (20) 


We naturally assume in these arguments that f(x, y) satisfies the 
conditions mentioned above for a < y < £ and for all x belonging 
to the intervals of integration. 

We can write, by the mean value theorem [T, 95]: 


zıt sX 
f f(x,y + Ay) da = Ax, f(a, + 6, Ax, y + 4y) = 
= Ax, [f(x Y) +m] 


Xa+ 4X; 


$ "Næ, y + Ay) de = Ax, f(a, + 0, Atty, y + Ay) = 


= Ax, [f(x2, y) + 12] 
(0 <8, and 6, < 1). 
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If Ay — 0, Ax, dr, 0, and we can say, by the continuity of 
f(z, y), that now n, hna —> 0. 

On substituting these expressions in (20) and using (15) and (16), 
we obtain after division by 4y: 


AI 


— [feu y) +m] aE le. ande 


Passage to the limit gives us, by (17), the following formula for 
differentiating integral (18): 


a i J fea) =f HEN dytjany) Yt ennt. (21) 


This becomes (13) if z, and x, are independent of y. Formula (13) 
is also valid for differentiation of a multiple integral with respect to 
a parameter, provided the domain of integration (B) is independent 
of the parameter. If, for instance, the integrand f(M, t) in a double 
integral over (B) depends on the parameter ¢ as well as on the variable 
point M, we have 


4; f [todo | | Aa. (22) 
© 8) 


It is assumed here that f(M,t) and of(M,i)/dt are continuous 
for M varying in (B) including its contour and for t varying in a 
given interval. 

The interval of integration must be finite in the proofs of (13) 
and (22). We apply (18) for an infinite interval in the examples, 
whilst indicating later the conditions for the validity of the procedure. 

It also follows from the above formulae that, if f(x, y), z(y) and 
z(y) are continuous, integral (18) is likewise a continuous function of y. 


81. Examples. 1. We found in [28] the particular solution of the equation 
dY ty as 
“ae + k?y = f (t), 
satisfying the conditions 


SA =0. (23) 
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It has the form: 


t 
yi (u) sin k (t — u) du. 
0 


This may easily be checked by direct differentiation in accordance with rule 
(21). We have: 


t t 
a) eee — ts) du +- 1 (u) sin k (¢—u) aia enna was 


d? í 
H = — r | F(u) sin k (t —u) du + f (u) cos k (t —u) = = — k?y + f (t). 
0 
i.e. in fact: 
TY 4 by =] (0. 


The expressions above give conditions (23) directly, on setting ¢ = 0. 
2. Let us evaluate the integral [T, 110): 


IC 
qi =| 1+ a? sai 


We introduce a parameter a and consider: 


ige 28D AG a, 
It is immediately clear that i 

I(0)=0 and IF(l=k,. 
Application of (21) in ee to a gives: 


aia) iee log (1 +a) — 


-f aa l + a? 


We get on decomposing the rational fraction into partial fractions: 


Oog To 1 a a x a 
araara oral otete 


and on integrating with respect to z: 


x _ _ log(l +a?) a arc tan a 
z (3 + ax) (1 + 2?) me 2(1 + a?) + lta ° 
Finally: 
dI (a) __ __— log (1 +æ) aarctana | log +a) log (1 -+ a?) aare tana 
da — 2(1+æ) l +a? I+a ~ 20 +a?) 1+ a? 


_ 1 flog(1 +a?) a arc tan a 
I@=5 f rpa tj) aya 


da, (24) 
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where we do not write a constant of integration since I(0) = 0. The second 
term may be integrated by parts: 


a a 
da = [ arctan a d log (1 + a?) = 
0 
a 
and therefore, by (24): 0 
I (a) = i -> arc tan a.log (1 + a?), 


whence, with a = 1: 


i= Jona +2) Pe ee 


J l+ F 
ò 


3. We evaluate the integral: 


œ 


f sin Pe dz. 
0 


We consider instead of this what at first sight appears to be more difficult: 
1(a,p)=[o-™ Fae (a >0). (25) 
0 
We differentiate with respect to £: 


w% œ 


ae B) =| T (a anes) dz =) e cos fx da, 


This last integral is easily evaluated [I, 201]: 


w% 


ƏI( 5 L(a, p) a x cos fæ de eter ZI conte tE sin fæ 5 = 4 7 
whence 
Ia, p) =| PSF +0 arctan +0, (26) 


It only remains to find the constant of integration C, which is independent 
of $. We let f tend to zero in (25) and (26): 


pe p) = I(a,0)=0; I(a,0) = arctan0+C=0, 


whence it is clear that C = 0. We thus have: 


= B 
I(a, 8) = are tan ~— : 
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The integral to be evaluated is obtained from I(a, 8) with a = 0, where a 
must tend to zero through positive values, ie. a— +0. If we let a approach 
zero in the above equation, we get different limits according to whether 
B>0O or B <0: 


jn for f >0 


lim artan ea l 
a>+0 a — yz" for b<0 


0 for B=0 
so that finally: 


> for B>0 


œ 


1(6) = | 


0 


sin Ba dx = 


ehg for B<0 
2 
o for B=0. 


The integral gives us a discontinuous function I($) of 8. The graph of the 
function consists of two half-lines and a point, and is shown in Fig. 74. 


Fie. 74 


4. On differentiating the obvious equation 


œ 


feras- 2 (a > 0) 


a 
0 
{The above arguments are not rigorous, since the equations have been 


assumed: Psst I(a, 8) = I(a, 0); lim I(a, $) = = I(0, £); these can be taken 


as self-evident if I(a, 8) is known to be continuous with respect to both B 
and a, A further point is that, if we had not introduced the factor e~°* under 


the integral sign, the meaningless integral f cos fx dx would have been obtained 
ò 


after differentiation. A rigorous proof of the continuity of I(a, 8) is given 
in [85]. 


254 MULTIPLE AND LINE INTEGRALS [81 


k times with respect to a, we get: 
> k! 
—ax „K C is 
| eo “g" dg = akti 
0 


We now consider the integral: 
I,=for™ a"dx (a>0). 
0 


If n is odd: n = 2k + 1, I, is obtained by substituting x? = t: 


e lf 1 k 
Loki = formate dz = = fe“ t dt = o Ghat 
0 0 


For the case of n even, we introduce a new variable of integration « = Vat 
into (4). After replacing ¢ by x again in the result obtained, we arrive at: 


a ESEA 
I, =fe masc- Z. 
0 


We find on differentiating this & times with respect to a: 


k 
<> =(— e x” dx, 
0 
whence 
z = 1-3...(2k—1) 
Ix = ( 1k 2 (+ 2) = 
Z dak \ 2 a 2 k+ — 
gk. 2 


5. We shall take a as constant in the integral: 
I(p)= [e-a cos Ba dx (a > 0), 
0 
which depends on two parameters a and $. We differentiate with respect to 


B: 


dl we — fe- sinfe- rde= -yy [sin bx de~ 
i ð ò 


We now integrate by parts: 


aI (6)_ 1 ap =P E f a soul ten 
ag ae? sin fol zle cos Ba da = 2a fe cos Ba dz, 
0 ô 
that is, 


ar p 
a a 
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This is a differential equation with separable variables: 


di (f)_ 8B 
T "ea 
whence we obtain on integration: 
zey 
1(p)=Co ™, @7) 


where the constant C is independent of 8. We substitute p = 0: 


oo 


; 1 m 
~—ax2 
10) =fe dz = -5 = ; 
0 


On the other hand, by (27): 
I0) =0, 
so that C = 1/2 /x/a; and finally, on substituting this value for C in (27): 


g 7 [= ft 
—ax? oo % 4a 

Je cos Sæde =- | Z e 

0 


On replacing a by a?, we get a result that will be used later in investigating 
the equation of thermal radiation: 


Sg n F 
fe 2 cos Ba dx = £a e tr, 
0 

82. Improper integrals. We have come across integrals more than 
once in which either the integrands or the limits have tended to 
infinity. We agreed in [I, 97,98] to ascribe a definite meaning to 
such integrals, provided certain conditions were fulfilled. We now 
treat the subject in detail. 

1. The integrand tends to infinity. In the integral: 


b 
Sf(a)de  (b>a) 
a 
let f(x) be continuous for a < æ < b, whilst tending to infinity at 
x = b; or more precisely, let f(x) be unbounded as x tends to b from 


below. We now have by definition [I, 97]: 


f(æ)dz= lim ‘ff (w) de, 


e>+0 a 


ae 


provided the limit on the right exists. The condition for existence 
may be investigated as follows. The necessary and sufficient con- 
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dition for a variable to have a limit is, by Cauchy’s basic test [I, 31], 
that the difference between any two values of the variable, as from 
a certain point of its variation, should be less in absolute value than 
any previously assigned positive number. The difference becomes in 
our case: 


b—s" 


b—e’ b—e” 
fiede- f jade= f fade (<e), 


and we arrive at the following general condition: a necessary and 
sufficient condition for the existence (convergence) of the improper 
integral 


b 
J f(a) ae, 


in which the integrand f(x) tends to infinity for x = b — 0, is that for 
any given small positive ô there exists an n such that 


b—e" 

| f f()dzj<6 for O<e’ and e”<y. 
b—e’ 

If we use the familiar inequality [I, 95]: 
be d b—s" d 
x| < x X; 
| J imde S l 
it follows immediately that the convergence of the integral 


f | f(x) |d (28) 


implies the convergence of 
b 
f f(x) da. (29) 
a 


The converse is not true, i.e. the convergence of integral (29) does 
not imply the convergence of (28). If (28) converges, (29) is said to 
be absolutely convergent [cf. I, 124]. 

A test of Cauchy’s which is very important in applications follows 
from the general test: if the integrand f(x) is continuous for a < x < b 
and, for x near b, satisfies the condition 


Fol gir (30) 
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where A and p are positive constants and p < 1, the improper integral 
(29) is absolutely convergent. 
On the other hand, if 


If (@)|> gap and p>, (31) 


integral (29) does not exist. 
We have, in fact, in case (30): 


b—e" b—e" b—e" a isis 
[Jr (a) de < JI f (x)| dz < 4 | w= = Ar =, 


where the right-hand side becomes as small as desired, with sufficiently 
small ¢’ and e”, since the power (1 — p) is positive (p < 1). 

In case (31), we can say first of all that the continuous function 
f(x) keeps a constant sign in the neighbourhood of x = b; this follows 
because the absolute value of f(z) remains greater than a positive 
number, by (31), so that f(x) cannot vanish, as would be the case 
if it changed sign. If we confine ourselves 
to the case of positive f(z), we have y 


b—s” b—s" 


Í jajäs>a J — 


b—-2’ 


AAA AR ete 


A log 7 for p=1 
= gP) erp) 
— i—p ” 
where the right-hand side can be made as 
large as desired with suitably small e’ ee 
and £”, since by hypothesis, 1 — p < 0. 

The geometrical interpretation of Cauchy’s test is very simple. 
In case (30), as x approaches b, the curve y = f(x) lies wholly inside 
the area included between the two symmetrical curves 


A 
Y = + Gyr (32) 


(Fig. 75); this area is finite for p < 1, so that the area under f(x) 
is also finite. In case (31), the curve y = f(x) passes outside the shaded 


258 MULTIPLE AND LINE INTEGRALS [82 


area, and since this is infinite for curves (32) with p > 1, the area 
under f(x) must also be infinite (Fig. 76). 

Exactly similar arguments apply when f(x) tends to infinity at 
the lower limit z =a, or at some intermediate point x = c of the 
interval of integration [I, 97]. 

2. Infinite limits. We now take the case 
of b = + œ, ie. the improper integral 


+o , b 
J i@)dz= „im J fe) de 


with the assumption that f(x) is contin- 
uous for >a. We get on applying 
YH Cauchy’s test as in the previous case: 
the necessary and sufficient condition for 
the existence (convergence) of the imprap- 
er integral 

+o 

f f(x) da (33) 


a 


is that, for any given small positive 6 there 
exists a positive N such that 


Fig. 76 


b” 
|{ f(x) da] <ð for b' and b >N. 
6 


In particular, we can prove Cauchy’s test just as in the previous 
case: if the integrand f(x) is continuous for x > a and 


lf@)|< and p>1, (34) 


the improper integral (33) is absolutely convergent. 
If, however, 


A 
|f(x)| > and p <1, (35) 


integral (33) does not exist. 
The treatment is similar for the improper integrals [I, 98]: 
b +0 
f f(w)dx and f f(x) da. 

We indicate a convenient practical method for applying Cauchy’s test. 
We start with an integral of type (33). Condition (34) for its convergence 
amounts to the existence of a p > 1 such that f(x) a? remains bounded as 
æ — + œ. This is evidently true if a finite limit exists: 


lim f(a) a?. 
X=» + 0 
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Similarly, condition (35) for divergence is satisfied if the limit exists: 


lim f(a? (p< 1), 


differmg from zero (finite or infinite). Thus, e.g., the integral of example 5 
[81] is absolutely convergent, since e~°** cos f+ a? tends to zero as æ > + œ 
for any positive p. This follows because the absolute value of cos Bx does 
not exceed unity, whilst it is easily shown that 97°% «P > 0 by applying 
PHôpital’s rule with, say, p = 2 [I, 65}. 

The integral: 


f 5x? + 1 
———d 
a3 + 4 
0 
is divergent, since 
: 5a? + ] 
lim ——— 
r->»po «£+4 


The integral of a rational fraction, with one or both limits infinite, is in general 
convergent only when the difference in the degrees of denominator and numera- 
tor is at least two. Furthermore, the denominator must not vanish in the interval 
of integration after any possible cancelling. If the interval is (— œ, +), 
the denominator must have no real zeros. 

Similar use can be made of conditions (30) and (31) for the convergence 
and divergence of integrals in which the integrand tends to infinity. For instance, 


c= 5 (p=1). 


is convergent for m< 2, since (sin x/x”)a™—! =sin x/x tends to unity as æ —> -+ 0, 
and p =m — 1 < 1. On the contrary, the integral is divergent for m > 2. 


83. Conditionally convergent integrals. Cauchy’s test only gives 
sufficient conditions (30) or (84) for the convergence of improper 
integrals. It cannot be applied say to conditionally convergent integrals, 
i.e. those where 


b +e 
f fx)da or f f(a) dx 
is convergent, whilst 


b œ 

Sæde or f |f(x)| dx 

a a 
is not convergent. We now give a test suitable for conditionally 
convergent integrals: if l 


Fa)= fidt (a>0) 
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remains bounded on indefinite increase of x, 
He) 
z 
xP dy 
a 


is ee for any p> 0. We have, on integrating by parts: 


[10a = s=j ha _ F(a) =n 
aP j|x=a 


N 
> F 
T p| 28, dy 


a 


or, since a = 0: 


N N 
F(N ) 
JE ax = FO + p | FO) ae. 


T 
a a 


The first term on the right tends to zero with indefinite increase 
of N, since F(N) is bounded by hypothesis and p > 0. The second 
term represents an integral which is convergent by Cauchy’s test, 
since the numerator F(x) of the integrand is bounded by hypothesis 
as x —> +œ, whilst the degree of x in the denominator is greater 
than unity. Hence the limit exists: 


Examples. 1. We take 


œ 


sin fx 
f LET. dz, (36) 


0 


already investigated in Example 3 [81]. We notice that the integrand takes a 
finite value f for x = 0, so that the integral is only improper on account of 
the infinite limit. It is clear that 


ie px dz = |- + cos Bx Tee ; 
whence 
N 
| fein pede |< $ (B > 9), 


N 
i.e. f sin Bx dx is bounded for any a and N. The theorem proved is therefore 
a 


applicable to (36), and this is convergent. 
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2. We also consider 
f sin (x?) da. (37) 
0 
On changing the variables in accordance with x = Jt, this is reduced to the 


form 


1 sin ¢ 
= dt 
F] 
PE ve 
and we can show itsconvergence as in Example 1. We consider in further detail 
the reasons for the convergence of (37). The graph of the integrand f(x) = sin (x?) 


y 


Fie. 77 


is shown in Fig. 77; since f(x) does not tend to zero as x — + œ, Cauchy’s 
test cannot be applied. We subdivide the interval (0, + œ) as follows: 


(0, ¥x); Va, ¥2x); (Va, V3x);...; Vna, Va F 1a); 


so that f(x) retains an invariable sign in each sub-interval: (+) in the first, 
(—) in the second, (+) in the third, etc. We write: 


Vari | 
Un=(—1" f sin (2) de. 
Ynn 


On replacing x by a new variable t: 


c= Vt + nn, 
we get: 


=o a cab sin £ dt 
Sp ftim s zj Vi+ nn ' 


whence it is clear that u, is positive and decreasing with increasing positive 
integral n. Moreover, it follows from the inequality 


a ta 
peda ija 
" 2 Yna 2 n 
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that u, > 0 as n — + œ. All this implies that the alternating series 
Ug — Uy + U — Uu, +- +(—1)"u,t... (38) 


is convergent [I, 123]. 
We now suppose that 


Yma <b < VY(m+ 1), (39) 
and consider the integral 
b Vz Yin ma 
f sin (a?) dw = f sin (2?) dx + f sin(a?)de7+...4 f sin (#*)dx + 
0 0 Vx Vimi 


b 
+ f sin (a?) da =u, — u, +... + (— 1)" umi +0 (— 1)" um (40) 
Yma 
where 0 < 0 <1, since the last interval (Vmr, b) consists of only part of 
(Vma, Vm 1)x) or is even absent with b = mz. If b— œ and the integral 
m, given by inequality (39), tends to (+0), the existence of the improper 
integra] 


+o b 
f sin (x?)dæ = lim f sin (æ?) dz = u, — u, + ug — u +.. 
0 b>+o 0 


follows from the convergence of series (38) and equation (40). 

The existence of the improper integral in this case is due to the alternation 
of the integrand, and to the fact that the successive areas, above and below 
the x axis, decrease in size and tend to zero on moving away from the origin, 
this last being due to the indefinite compression of the areas and not to the 
fact that their heights tend to zero. 

Integral (36) may be considered in exactly the same way. 

We obtain the following value for integral (37) in Volume ITT: 


+0 +0 a 
f sin (x2) dx = [cos (—2?) dz = -> -4 : 
0 0 


The integrals written are known as Fresnel or diffraction integrals, the latter 
name being due to the role they play in optics. 


84. Uniformly convergent integrals. If the integrand of an improper 
integral depends on a parameter y, the numbers 7 and N mentioned 
in the general tests 1 and 2 of [82] in general also depend on y. 


+It may be useful to revise the theory of uniformly convergent series in 
Volume I before reading the present section. 
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If, as y varies in the interval a < y < $, the numbers n and N in the 
conditions 


T Heydsr 


be 


3 
{ Hay) da 
b 


<6 for O<e’ and # <n (41) 


<ô forb and b >N (42) 


can be chosen independently of y, the improper integrals 


f Heyde, Y Heyde (43) 


a a 


are said to be uniformly convergent with respect to y. 
In particular, the integrals encountered when applying Cauchy’s tests 
are uniformly convergent if the constants A and p are independent of y. 
Every convergent improper integral can be written in the form 
of a convergent series in which each term is an ordinary integral. 
We can use this approach for the above; let us take the first of integrals 
(43). Let 
Ey, Eg, Egye ey Enron (44) 


be a series of positive, decreasing terms which tend to zero; then 
we can write: 


b b~s, be, bey 
f Mayda f Ke y)de+ f +f +...+ 


b— enti 
+ f +. = u(y) +u (y) + uly) +... + Un ly) t... (45) 
b— en 
where 
b— Ent, 
tin(y) = (Fle, y) de. (46) 
b— en 


For the second of integrals (43), we can specify a series of in- 
definitely increasing numbers 


bi ba ba... bpa. (47) 


and obtain: 


bngs 


Y ta yde= (feydet f+ fH + hb. 
An a b by bn 


= Uy (Y) + Uy (Y) + Ue (Y) +.. -Huny +... (48) 
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It follows at once from the definition of uniform convergence of 
integrals and series [I, 143] that, if an improper integral is uniformly 
convergent, the series corresponding to it is also uniformly convergent 
for any choice of numbers (44) or (47). In fact, if we sum say remote 
terms of series (45), we get the integral over an interval close to b 
for which inequality (41) is satisfied. 

Uniformly convergent integrals have similar properties to those of 
uniformly convergent series [I, 146]. We shall state these explicitly 
for the second of integrals (48), but what is said is likewise applicable 
to the first integral. 

(1) If f(x, y) ts continuous for a < x and for y in a finite interval 
a < y < B, and if 


"F fle,y) de (49) 


is uniformly convergent, the integral is a continuous function of y in 


a<y<fB. 
(2) With the same conditions, the formula for integration under the 
integral sign is also valid: 


id bed +o 
f dy f fæy)ds= f de f fæ, y)dy. (50) 


(3) If integral (49) is convergent with continuity of f(x,y) and 
af(x, y) Əy, whilst 


+% 
Of (x, y) 
Ena (51) 
a 
is uniformly convergent, the formula for differentiation under the integral 
sign is valid: 
at, TE) 
®, 
gli (a, y) dz = |I dx. (52) 
a a 
We give the proofs of properties (1) and (3) as examples. The 
terms of series (48): 


bnp 
un (y) = § f(x,y) dx, (53) 
bn 


are continuous by what was proved in [80], and the series is uniformly 
convergent due to the uniform convergence of the integral; hence the 
sum of the series, i.e. integral (49), is also continuous [I, 146]. 
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We prove (3) by noticing the consequence of [80], that integral 
(53) can be differentiated under the integral sign, i.e. 


Ons 
un (y) = | HEM ae, 
bn 

But we have, by the uniform convergence of integral (51), the 

uniformly convergent series: 

$0 

Hew 4 oy WEY) de = Surly). (54) 

a n=0 bn n=0 
Series (48) is thus convergent, whilst the series of derivatives is 
uniformly convergent. Hence it follows [I, 146] that the sum of 
series (54) is the derivative of the sum of series (48), which leads us 
o (52). 

We indicate a simple test for absolute and uniform convergence 
of improper integrals, similar to the corresponding test for series 
[I, 147]. To be explicit, we take the second of integrals (43); the test 
is similar for the first integral. 

Let f(x, y) be continuous as usual fora < æ and a < y < $. If the 
positive function (x) exists, continuous for a < x, such that | f(x, y) | < 

x) for a < xanda < y < f, and if 


+o 
J p (x) dx (55) 


is convergent, integral (49) is absolutely and uniformly convergent 
(with respect to y). By the convergence of (55), for any given ô > 0 
there exists an N such that: 


b 
fo(z)der<ð for b', b >N, 
b’ 


this N being independent of y, since g(x) does not contain y. But we 
have, since | f(z, y) | < g(x): 


b br , 
|fiænal< fifæyld< f pæde<o for v,e >N, 


i.e. the same N, independent of y, is suitable for integral (49) and 
further, for the integral 


fis (æ, y) | de, 


which proves our statement. 
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85. Examples. 1. We consider Example 3 of [81] in more detail: 


ao 


I (a, b) = Je 


0 


oza SP? ae. (56) 


Let a be a fixed positive number for the present, so that (56) depends on the 
parameter $. We notice that (sin Bax)/x remains continuous at x = 0, where it 
takes the value £; hence (56) is an improper integral only on account of the 
infinite limit. For positive x > 1, we have | (sin fx)/z| <1 and therefore 

joe sin Bx [<e 
whilst the integral 


is convergent; hence, by the test proved, (56) is uniformly convergent with 
respect to 8. On differentiating with respect to $ under the integral sign, we 
get 


b 
$ e7% cos Ba da, 
ò 


which is also uniformly convergent, since | e7% cos fx | < e~™. Hence it follows 

that (56) is a continuous function of f and can be differentiated under the integral 

sign. All the arguments of the example mentioned will be justified if it is proved 

that. lm I(a, B) = I(0, £), i.e. (56) with fixed £ is a continuous function of 
a— 

a to the right of zero. We prove the continuity for a > 0. The convergence 

for a = 0 has already been shown above. 

We can take $ > 0 without loss of generality, since B < 0 reduces to this 
case by simply changing the sign of the integral, whilst our statement is obvious 
if B =0. 

We shall proceed as in [83] for Fresnel integrals. We subdivide (0, + œ) 


as follows: 
oF). FB). MPA... 


so that the integrand 


ja) = 07 EE a >0 and £ >0) 


is (+) in the first sub-interval, (—) in the second, and so on. We write 
(n+1) x 


Pan 
un (a) = (= D” fom A ae 


na 


B 
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We replace x by a new variable ¢:¢ = x — nx/B, and get: 
Uy (a) = 


K 

f haw sin bi 

f sata aE _ SE gi, 
B 

0 


whence it is evident that the u,(a) are positive and decrease with increasing n. 
Furthermore, since 


F 
1 l 
TORE (57) 
° p 


it follows that up(a) — 0 as n> + œ. 
We can thus write our integral for a > 0 as the sum of an alternating series: 


+o 


f ores SEPE de = u (a) — u (a) + uz (a) —...+ (— 1)" up (a) +... (58) 


0 


We have for the remainder term of the series, by (57) and the theorem 

of {I, 123]: 

In (a) | < | Un (| < ner , 
where I/(n+-1)—+ 0 as n + + œ independently of a. The series is thus uniformly 
convergent for a > 0, and its sum is therefore continuous [I, 146], inasmuch 
as the terms u,,(a) are continuous by [80]. 

We remark that the uniform convergence of the integral does not follow 
without further argument from the uniform convergence alone of series (58) 
for a > 0. It can be shown that the integral in this particular case is in fact 
uniformly convergent for a > 0. 

We notice that 


w 


e dz 
1 x 


> 


0 


which is equal to 2/2 for 8 > 0, (— 2/2) for <0, and zero for 8 = 0, 
gives a function of $ with a break in continuity at 6 = 0. Hence it follows 
that this integral cannot converge uniformly with respect to ĝ in an interval 
containing f = 0. If we take the interval to the right of zero, the integral, 
equal to 2/2, has a zero derivative with respect to £; the integral cannot 
be differentiated with respect to 8 under the integral sign, however, since this 
yields the integral of cos fa from 0 to œ, which is meaningless. 
2. In Example 4 of [81], we differentiated 


o 


fear ar (a > 0) 
a 
0 
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k times with respect to a under the integral sign. To justify this operation, it 
is sufficient to show that, with positive integral k, 


e7 gk dx 


Semn 8 


is uniformly convergent for every interval c < a < d, where ¢ > 0. Since 
x> 0 in the interval of integration, obviously e7°* < e7% and e`°* ak < 
< e7 * xk, and by the test for uniform convergence proved in [84], it is sufficient 
for us to prove the convergence of 


oo 
fe™ ok da. 
0 


We write f(x) = e-*a* and apply PHôpital’s rule in the ordinary form 
[I, 65] to see that f(x) 2? = e7% a*+2 — 0 as x — + œ; hence, by the test proved 
in [82], the integral written is in fact convergent. 

3. We obtained the solution of Abel’s problem in [79] as: 


Va a f ehar 
MOS de J Yeh ` 


We show how the derivative on the right can be evaluated. We write: 


* oh) dh 
à Yz=h — 


If we were to differentiate with respect to z under the integral sign, we should 
get (z -- h)~3/* in the integrand, which leads to a divergent integral [82], so 
that we must proceed otherwise. We assume the existence of a continuous, 
bounded derivative y’(h) in the neighbourhood of k = 0 with k > 0, and 
integrate I(z) by parts: 


I(z) = 


z z 
p(h)dh og [ SES — 2p (h) yz g "7 
= 2 fama i PO) Bl + 


+2 f g(h) Yz —h dh = 29(+ 0) Yz +2 f g (h) Vz =h dh: 
0 0 


It may be recalled that y(+0) = jim Oth) This is a constant which in general, 
> 


differs from zero, whereas 9(0) = 0 by definition. We obtain on differentiating 
the above expression and using (21) of [80]: 


df od AO, (OM ap A 
A Vh Vz + | = 


0 
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If p(h) is constant, p’(h) = 0, and we arrive at the expression already obtained, 
If (+0) = 0, we have 
z z ; 
een AU B 
dz 3 Vz—h F yz—h 


We pass over the proof of the applicability of (21) of [80] to the improper 
integral I(z). We notice that, if k is replaced by a new variable of integration 
u, where h = zu, an integral with constant limits is obtained for I(z): 


1 
re) =z { 2S 
| Jl-a 
0 
It is easily shown that, with the above assumption of the existence of a 


continuous, bounded derivative p(k) for h > 0, we can differentiate under the 


integral sign: 
1 1 
dI(z) _ l [g (zu) du z f g (zu) udu 
dz “aE J yi-—u +z | ji-u ` 


If we integrate the first term by parts and return to the former variable h, 
we again get (59). 


86. Improper multiple integrals. We shall start by considering 
improper double integrals. Improper integrals can be of two kinds, 
as above: either the integrand, or the domain of integration, is 
unbounded. We begin with the first kind, and let f(M) be continuous 
in a finite domain (ø) excluding the point C, in the vicinity of which 
/() is unbounded. We surround C with a small domain (4), so that 
/(M) is continuous throughout the remaining domain (oe — A), and 

f f AM) de. 


(0—4) 
has a meaning. 


If the integral tends to a definite limit as 4 contracts to C, inde- 
pendently of how the contraction takes place, the limit is called the 
improper integral of {(M) over (o): 


{ { KM) do = lim f S120) do. (60) 
(0) (o—A 


To start with, let f(J£) be positive, or more precisely, non-negative 
in the neighbourhood of O. Let (A’) and (4”) be two small domains, 
where (4”) lies inside (4’). The integral over (e — A”) will now differ 
from the integral over (¢ — A’) by a positive quantity, equal to the 
integral over (4’ — A”) in which f(M) > 0. It follows at once from 
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this that integral (60) increases on indefinite contraction of the (4) 
to C (if each successive (4) is included in the previous (4)); hence, 
the integral either tends to a limit or increases indefinitely. If the 
limit is finite for one method of contracting (4) to C, the same limit 
will be found by any other method of contraction. The existence of a 
limit is characterized by the fact that the integral over any domain 
which excludes C but lies in the neighbourhood of O, where f(M) is 
positive, remains less than a definite positive number (the integral 
here will tend to zero if the neighbourhood contracts to C). If (M) < 0 
near C, we get the above case on taking the minus sign outside the 
integral. We now let f(M) change sign in any small neighbourhood of C. 
We shall consider in this case only absolutely convergent integrals, 
i.e. such that 


J p Wablde (61) 


has a meaning, that is to say, is convergent. The integrand is now 
non-negative, and the above remarks are applicable. It follows in 
particular from these remarks that, if f}i(M), fi( M) are two positive 
functions, whilst f(M) < f({M) and the integral of f({ M) is con- 
vergent, the integral of f(M) is likewise convergent. We now write 
our f(M) as the difference between two positive functions: f(M) = 
= | f( M) | — [| (M) | — fA]. Integral (61) is convergent by hypo- 
thesis, so that the integral of 2 |f(M)| is convergent. We have 
[| f(A) | — f(AD)] = 2 | (M) | when { M) < 0,and = Owhen f(M Y> 0, 
i.e. 0 < |} M)|— M) < 2|f(M)|, so that the integral of this 
positive function is also convergent. It follows that the integral of 
the difference: |f(M) | — [| f(A) | — {(M)] = f(M) is convergent. 
Hence, if integral (61) is convergent, the integral of f(M) is also con- 
vergent. 

We indicate a sufficient condition for the convergence of (61): 
if | f(M)| < AJr? in the neighbourhood of C, where r is the distance 
of the variable point M from C, and A and p are constants and p < 2, 
integral (61) is convergent. From what has been said above, it is 
enough for us to show that (61) is bounded over any domain (o’) 
which excludes C but lies in a circle of centre C, radius 75. We use 
polar coordinates with C as origin, and note the inequality written 
above for | f(Jf) |; we have: 


| {yanido< A[[-prdrdy = A f f -5r dardo. 
o) (2) (o) 
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The domain (o’) must lie inside a ring bounded by the circles r = 7 
and r = Tọ where 7 can be taken as small as desired. The integrand 
is positive, so that we can only increase the result by integrating 
over the whole of the ring, i.e. 


2x te 
1 QnA 
[TILAN |do < A fap fais dr = PAA ge — gv), 
(9) 0 n 
Since 2 — p > 0, this gives us finally for the integral over (o): 
n 217A 
JJ lFan |do < Sore, (62) 


(0’) 
which proves the above statement. 

It may be noted that the integral over (o’) may be as small as 
desired, given a sufficiently small rọ- 

The definition is similar for an improper triple integral over a 
finite domain (v), when {(M) is unbounded near the point C. All the 
above remarks apply, except that the sufficient condition for absolute 
convergence now reads: if | f(M)| < Afr? in the neighbourhood of the 
point C, where r is the distance of the variable point M from C, and A 
and p are constants and p < 8, the integral 


§{SFap dv 


is absolutely convergent. Here, the condition p < 2 is replaced by 
p < 3, since an elementary volume in polar coordinates in space is 
dv = r? sin 6 dr d0 dp (with 7? instead of the r in do = r dr dọ). 

We now take the case when the domain of integration (øe) is un- 
bounded, i.e. extends to infinity in every direction. Let (0,) be a 
finite domain contained in (o), and let it be extended so that any 
point M of (c) eventually lies inside (o,). Assuming that f(M) is 
continuous in (ø), we can form the integral 


fff (MN do. (63) 
(0) 
If the integral tends to a definite limit on indefinite extension of (c), 
the limit is defined as the integral of f(M) over the infinite domain (0): 


f f f(D) do = lim f f f (DL) do. (64) 
(0) (21) 

If (M) > 0 for all sufficiently remote M, integral (63) either has a 

limit or increases indefinitely on extension of (¢,). The first case is 
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characterized by the fact that the integral is bounded over any domain, 
or in fact any finite number of domains, belonging to (c) and lying 
outside a circle with centre at the origin and radius 7, (with this, 
the integral tends to zero if r¿-> œ). Let (o’) denote the set of these 
domains. We notice a consequence of the definition of improper inte- 
gral, that if 

J JF (8 |do, (65) 


is convergent, (64) is also convergent. Integral (64) is now said to be 
absolutely convergent, and we shall only consider integrals of this 
type. The following sufficient condition for convergence may easily 
be proved: if | f{(M)| < Ajr? for all sufficiently remote points M, 
where r is the distance from any fixed point (origin) to the variable 
point M, and A and p are constants and p > 2, integral (64) is con- 
vergent. If we use the inequality written and introduce polar coordinates, 
we have: 

[JI AD [do < Af f drag. 

(o) (’) 
The set (o’) must be contained within a ring bounded by circles of 
radii r = 7), r = R, where R can be as large as desired. Integration 
over the whole of the ring gives: 


R 


27 
[JF da< A fap f 
(0’) 0 To 


Since p — 2 > 0, we have finally for the integral over (o’): 


2rA 1 
[FAD ldo < p a. 


1 
wai dt = os 


m D a 


2nd fı 1 ) 
(y= Rp 


which proves our proposition. The integral over (o’) may be as small 
as desired, with r, sufficiently large. 

An improper triple integral over an infinite domain is similarly 
defined. The condition p > 2 in the last theorem has to be replaced 
by p > 8 for triple integrals. It may also be noted that the above 
remarks about improper double integrals in which f(M) tends to 
infinity are applicable to improper integrals over curved surfaces; 
these reduce to plane surface integrals, as we have seen [63]. 

An absolutely convergent improper integral reduces, as we saw 
above, to the integrals of the non-negative functions | f(J2) | and 
[| M) | — f(M)], whilst it is of no consequence for these integrals 
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how (4) contracts to the point C or how (a,) is extended. It can always 
be assumed that (4) is a circle or a sphere (4,) with centre C and 
whose radius g tends to zero, and that (o,) is a part of (o) consisting 
of a circle (Kp) with centre at the origin, the radius of which increases 
indefinitely. The uniform convergence of improper multiple integrals 
which depend on a parameter may easily be defined with the aid 
of these remarks. For instance, integral (60), when its integrand 
depends on the parameter a, is said to be uniformly convergent with 
respect to a if, for any positive 6, there exists a positive n, independent 
of a, such that 


Sf FN de|<6, 


where (o°) ts any part of (o) contained in the circle (A,). The uniform 
convergence of the other improper integrals is similarly defined. 
In particular, it follows from (62) that the integral is absolutely and 
uniformly convergent if A and p are independent of a. 

The properties and test of [84] apply for the uniform convergence 
of multiple integrals. 

A more difficult problem arises in the case of improper multiple 
integrals where the integrand is unbounded in the neighbourhood of 
a line (l) instead of in the neighbourhood of a point. Here, the line 
has to be excluded with the aid of a domain (4), then (4) is allowed 
to contract to the line. 

If {(M) is assumed positive in the neighbourhood of (l), we can 
say that the integra] over the remaining domain either tends to a 
limit or to infinity, independently of the method of contraction of 
(A) to (1). Definitions similar to the above are obtained for absolutely 
convergent integrals, which are the only type that we take into 
consideration. 


87. Examples. 1. We consider: 


Jj TF eee 


where (g) is the whole plane. We use polar coordinates and integrate over the 
circle (Kp) with centre at the origin and radius F, and get 


H er l = a [ (l Jm 1]. 


If a <1, the right-hand side increases indefinitely on indefinite increase of 
R and the integral is divergent. If a > 1, the right-hand side has a finite limit 
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x/(a — 1), so that the integral is convergent and equal to 2/(a — 1). The con- 
vergence in this second case can be proved by applying the sufficient condition 
stated in the previous article. 
2. We take: 
f y da dy 
int? 


where (a) is the square bounded by « = 0,« = 1, y = 0, y = 1. The integrand 
tends to infinity along the side x = 0. We exclude this side by means of a narrow 
vertical strip, and integrate over the rect- 
angle (o,), bounded by x = e, œ =1, y = 0, 
y=1(e> 0): 


1 1 
[jee fou fE aa 


Cs 


and we have the limit unity as € — 0, ie. 
the integral is convergent and equal to unity. 
Fre. 78 3. The attraction exerted by a mass on a 
particle, located outside or inside the mass 
(Fig. 78). Let the particle, at C(x, y, 2), be 
of unit mass. We divide the attracting body (v) into elementary masses Am 
containing the points M(é, ņ, ¢). We take the total mass 4m as concentrated 
at M and get the approximation for the attraction at C due to Am: 
Am 
T 
where r is the distance CM, and the gravitational constant is taken equal 
unity. Since the attraction is along CM, its projections on the axes are: 
Am -e Am n—y. Am €—z 


rr a a ee ee 


The aie of the total attraction will be given approximately by 


aa Ik 
A 


Ya SSL 1—Y Am; pawe 


pee T 


If u(t, n, ¢) denotes the density of the body at M, we have: 
Am ~ pA, 


and finally, if we decrease each element indefinitely whilst increasing their 


number, we find: 
xo [[Jotatin y= [ftra ze fff 
(v) 


We notice that the coordinates (£, 7, ¢) of the variable point M of (v) are 
the variables of integration here, the density u(é, 7, ¢) being a function of them. 


7 dv. (66) 
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The coordinates (x,y,z) of O appear both directly in the numerator of the 
integrand and indirectly via 


r =V =a} +0 — y} + (0-2), 


and these coordinates act as parameters, X, Y and Z being functions of 
(£, y, 2). 

If C is outside the body, r never vanishes and we have ordinary integrals 
to deal with. If C is inside the body, the integrands in (66) tend to infinity as 
the point of integration M approaches C, so that we are concerned with improper 
integrals. These clearly have a meaning, however, if we take u to be a continuous 
function, since, on writing H, for the upper limit of the function | u |, we have: 


E 


Jok 
ai Z< 4 
r r 


md | < Ho 
|u 75 | | eS (67) 
so that p = 2 in the rule given above and A = pp. 
A meaning may also be attached, all the more, to 


offs, 0s 


@) 


denoting the potential of the mass concerned at the point C. (A more detailed 
treatment of potential will be found below). 
4. We have the obvious expressions 


r 
eed ro) (eae a 
ae =a (+): = * 


so that integrals (66) can be written in the form: 
? ə /1 H ð (1 
z=jf feae rff a) 
() @) 
y ð (1 
a. fea) 


v) 
i.e. these integrals are found by differentiating (68) under the integral sign with 
respect to x, y and z. The differentiation is carried out with respect to the 
coordinates (x,y,z) of points at which the integrand is discontinuous, and 
this case does not come within the scope of the conditions for the theorems 
established above [84], regarding continuity and the possibility of differentia- 
tion under the integral sign. We shall see later [200] that, provided m(é, 7, ¢) 
is continuous, integrals X, Y, Z are continuous functions of (x, y, z) throughout 
space, whilst U is a continuous function with continuous first order partial 
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derivatives, these latter being obtainable by differentiation of (68) under the 
integral sign, i.e. 


ou Ou Ou 


Bo Ox’ ay ’ Oz 


Further differentiation of the potential U with respect to x, y, z under 
the integral sign gives us, since “(£, 7, ¢) is independent of (x, y, z): 


“ear [Je ale ae SIS" ae 
oe =S ffe as Ea (e 


These formulae are valid only when C(z, y, z) lies outside the attracting 
body (v), in which case they contain only proper integrals. If C is inside (v), 
it is easily shown by direct double differentiation that 


(69) 


wa H - oe aht- 


2 Š 3? 2 5 3 
Oa? Lr r r oy r r (70) 
A ae 
at? (r) re? 


and the convergence test of [87] is no longer applicable to (69), ie. with C 
inside (v), the second derivatives of U cannot be obtained by differentiating 
twice under the integral sign. 

We get, on adding equations (70): 


ae See ta E 


Ont Oy? (r) © O22 rs r3 


and therefore, on adding equations (69), which are valid for C outside (v), 
we find the equation: 


gU aU eU 


ant aye Tae TO (71) 


The mass potential U(x, y,z) of a volume satisfies (71) at points C(x, y, z) 
lying outside the volume. We show further on how the equation has to be modified 
if C lies inside the volume. 

5. We take the case of a homogeneous sphere of radius a ( = constant). 
We take the axis OZ along OC, where O is the centre of the sphere (Fig. 79), 
and introduce spherical coordinates (9, 9, 9): 


Qn na 


U = a ee Grane: aydo de. (72) 
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Obviously, 
r? = 0? + 2? — 2oz cos b. (73) 
We take first the integration with respect to 0: 
f sing do 


r 
0 


We take r as variable instead of 6, ọ and pọ being assumed constant. Two 
cases must be distinguished here: if z > 0, as 0 varies from 0 to x with ọ and 


Fie. 79 


p constant, r varies from (z — @) to (z + @); if z < ọ, r varies from (@ — 2) 
to (o + z) (Fig. 80). Also, from (73), with constant ọ and 9: 


sin 6 dé dr 


r dr =z sin 6 d9; ————_— = ——-.. 
r oz 
Hence we have: 
Ze aro a 
r 
x oO (2 >e) 
f sin6d@ | z-e 
` r ~ ) etz 
: [2-2 e<o 
oz e i 
e-z 


We again distinguish two cases, on substituting from here into (72): 
(1) C is outside or on the surface of the sphere: we have a < z, and ọ < z 
throughout the interval (0,a), so that 


2n a %d 
A. do _ 4nay om 
U = u | ap (2E Trag oa. (74) 
0 0 


where m is the total mass of the sphere. 
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(2) C lies inside the sphere (Fig. 80): here we have to divide (0, a) into two 
intervals (0, z) and (z, a), and we get 


27 z 22d a zd 
U =p fap| [2E + [AES | 2a (at — -2 ); (75) 
J 8 e 3 
0 0 z 
with z = a, i.e. the point on the surface of the sphere, (74) and (75) both give 
the same value for U, which proves the continuity of the function U. 
We now calculate the attraction. This must be along OZ by symmetry, so 
that we only have to find: f 
3U 


Z= -5 ` 


We use (74) when C is outside the sphere: 


m 4 
Z=— 3 (76) 
when C is inside, (75) gives 
Z=— -$ nuz. (77) 


Expressions (76) and (77) coincide for z = a, proving the continuity of the 
attraction Z. 

We see from (74), (76), (77) that the potential and attraction of a homogeneous 
sphere at a point outside it are found by concentrating the total mass of the sphere at 
its centre. Further, the attraction at an internal point of the sphere is proportional 
to the distance of the point from the centre of the sphere. 

We chose the axes in a particular way for simplicity, with C lying on OZ, 
with the result that z in the above expressions is the distance of C from the 
centre of the sphere. With any other disposition of the axes (with the origin 
at the centre of the sphere), z must be replaced by Yx? + y? + 22, where (x. y, z) 
are the coordinates of C as usual; (74) and (75) now become: 


m 


U = — 
Ypy te 


(C outside the sphere); 


U = 2p a? — = (x? +y? + =| (Q inside the sphere). 


The first expression for U evidently satisfies (71). If we differentiate the second 
expression twice with respect to x, y, z we get: 


eU eu aU 


Oat + ay? + he 4x (C inside the sphere). (78) 


As we shall see later, this equation is valid for any volume (v) of variable 
density, when C lies inside (v). 
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6. Let the attracting mass be distributed over a surface (S) with surface 
density (MM), which is a function of the position of the variable point M on 
(S). Let C(x, y, z) be the attracted particle of unit mass as above, and r the 
distance |CM |; we get for the potential: 


u=(f (M) as (79) 
is 


and for the projections of the attraction: 
ðU 61 0U 01 
x= Z-=|fuang- (jes v=4- = [Jeon a, (+) as 
(S) (S) 
3U i 01 
Bae = | {mae (G) a8 
(S) 


We generally speak of (79) as the potential of a simple layer. We only take 
into account in this problem the case when C lies outside (S), so that all the 
integrals are proper. Potential (79) now satisfies equation (71), as above. 


§ 9. Supplementary remarks 
on the theory of multiple integrals 


88. Preliminary concepts. We took as our starting point in the treatment 
of multiple integrals the intuitive ideas of area and volume. The present article 
is devoted to the justification of these ideas and to a rigorous discussion of 
the basis of the theory of such integrals. We start by establishing certain con- 
cepts and proving a number of useful theorems regarding sets of points. Our 
treatment is confined to the case of a plane, since all the arguments are easily 
extended to the case of space. 

We take a plane referred to Cartesian axes XY. The e-neighbourhood of the 
point M is defined as the circle with centre M and radius €. We shall consider 
all possible sets of points in the plane, which can consist of either a finite or 
an infinite number of points. Let (P) be a given set of points. The point M 
is called a limit-point of the set (P) if an infinite set of points of (P) 
belongs to any e-neighbourhood of M. The point M may belong to (P), or it 
may not. If all the limit-points of (P) belong to (P), we say that (P) is a closed 
set. M, belonging to (P), is called an interior point of (P) if all the points of 
an e-neighbourhood of M belong to (P). 

For example, let (P) be the set of all points lying inside the square: 0 < 
<a <1,0<y < l. Every point is now an interior point of the set, and is 
also a limit-point. All the points belonging to the boundaries of the square, 
i.e. its sides, will also be limit-points of the set. Since we have not included 
these latter in (P), the set is not closed. 

An open set or domain is defined as a set, all the points of which are interior 
points. A connected domain is defined as an open set (P), such that any two 
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points of (P) can be joined by a line, all the points of which belong to (P). 
With this definition, the interior points of a square form a connected domain, 
whereas the interior points of two separate squares do not. What we have 
referred to above as simply a domain is sometimes called an open domain. 
The boundary of a domain (P) is defined as the set (l) of points M’ with this 
property: the M’ do not themselves belong to (P), but any e-neighbourhood 
of M’ contains points of (P). Since (P) consists of interior points, we can 
say that an infinite set of points of (P) lies in any e-neighbourhood of M’, 
and we can define the contour (l) of a domain as the set of limit-points 
of (P) which do not belong to (P). It may easily be seen that (2) is a closed set: 
we show that, if N is a limit-point of (1), it belongs to (l). There are points M’ 
of (l) in any e-neighbourhood of N, by the definition of a limit-point, whilst 
N cannot belong to (P) since all the points of (P) are interior points. But there 
are points of (P) in any e-neighbourhood of an M’ (by the definition of bound- 
ary), and hence, there are points of (P) in any e-neighbourhood of N, i.e. N 
in fact belongs to (l). If we include in (P) its boundary (1), a closed set (P) is 
obtained, which is occasionally referred to as a closed domain. If there are points 
of (P) in any e-neighbourhood of a point M, whilst M does not belong to (P), 
there must be points of (J) in any e-neighbourhood of M and it follows, since 
(l) is closed, that M belongs to (l) and therefore to (P). If M belongs to (P), 
it will certainly belong to (P). It follows from what has been said that (P) 
must in fact be a closed set. We remark that after inclusion of the points of 
the boundary (l) in a domain (P), the boundary points can become interior 
points of the new domain (P). If, say, (P) is a square with an internal incision, 
the points of the incision are points of (l), but they become interior points of 
(P) after including (2) in (P). 

We now introduce some concepts referring to any set of points (P) in a 
plane, and not merely to a domain. We define the derived set (P’) of a set (P) 
as the set of all limit-points of (P). The proof that every derived set is closed 
follows exactly the same lines as the proof that (l) is closed. Let (P,) be the 
set of all the points of the plane not belonging to (P); it is usually called the 
complement of (P). The boundary (l) of a set (P) is defined as the set of points 
either belonging to (P) and the derived set of (P,) or to (P,) and the derived set 
of (P), i.e. belonging to (P) and (P{) or to (P’) and (P,). This definition amounts 
to the previous one, as regards the boundary of a domain. Another definition 
of boundary can be given, equivalent to the above. We call M an isolated 
point of a set (P) if there exists an e-neighbourhood of M which contains no 
points of (P) apart from M itself. It may easily be seen that the boundary of 
a set (P) consists of the isolated points of (P) and of the limit-points of (P) 
that are not interior points of (P). It can be shown, as above, that (2) is a closed 
set. We shall be dealing in future chiefly with domains. 

It should be noted that all the above is applicable to sets of points on a 
straight line, which can be taken as the axis OX. What we call an e-neighbour- 
hood of the point a = c now becomes the interval (c — €, c + e), i.e. the interval 
of length 2e with its centre at the given point. 


89. Basic theorems regarding sets. A set (P) is said to be bounded if all its 
points lie in a bounded part of the plane. This latter can always be taken as 
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a square with sides parallel to the axes. We can therefore say that (P) is bounded 
if all its points belong to such a square. 

THEOREM l. Every infinite bounded set (P) has at least one limit-point. We 
prove this first for the case when the points of (P) lie on a straight line, say 
on the x axis. Set (P) is infinite by hypothesis, i.e. it contains an infinite num- 
ber of points; also, since (P) is bounded, all its points belong to a finite inter- 
val (a, b). We bisect (a, b). At least one half (a,, 6,) contains an infinite set of 
points of (P). We now bisect (a,,6,). At least one new half (az, 6.) contains 
an infinite set of points of (P), and so on. We get the sequence of intervals 


(a, b,) (Gy, by), (Gy, ba) -> (am Dnb- 


each successive member of which is half the previous one, whilst all the (an, bn) 
contain an infinite set of points of (P). We know [{I, 42] that a,,b, have 
a common limit p. Given any «> 0, the interval (p — £, p + £) contains 
all the (an, bn) as from a certain n, and therefore contains an infinite set of 
points of (P), i. p is a limit-point of (P), which is what we wanted to 
prove. 

We now prove the theorem for a plane. Since (P) is bounded, all its points 
belong to a square a < x < b,c < y < d, which we shall denote symbolically 
as [a, b; c,d}. We divide the square into quarters. At least one quarter [a,, b; 
cı, dı] contains an infinite set of points of (P). We next divide this quarter 
square into quarters, at least one of which contains an infinite set of points 
of (P), say [a;, bz; c2, dz], and so on. We get two sequences of intervals 


(a,b), (ap bi), (az boh- - -> (Am Bnb- -- 
(c,d), (c1, d1) (€p dz). <-> (Cm Anh- -+ 


in each of which each successive member is half the previous one. Hence, 
an and bp have a common limit p, whilst cn, dn have limit g. By construction, 
any -neighbourhood of the point (p, q) contains all the squares [@n, bn; Cn, dn] 
as from a certain n, and therefore contains an infinite set of points of (P), 
i.e. (p, q) is in fact a limit-point of (P). 

We take a sequence of -neighbourhoods of the point (p, q), with e taking a 
decreasing sequence of values ¢,, &, ... which tend to zero. Let M, be a point 
belonging to (P) in the e,-neighbourhood of (p, q). Let M, be a point belong- 
ing to (P) in the «,-neighbourhood of (p, q), differing from M,. Let M, be a 
point of (P) in the e,-neighbourhood, differing from M,, M, and so on. This gives 
us a sequence of points M,, which will tend to the point M(p, q), i.e. the distance 
M,, M tends to zero or, what amounts to the same thing, the coordinates 
(£n, Yn) of the point M, tend respectively to the limits p and q. In other words: 
a sequence of points, tending to a limit, can be chosen from an infinite bounded 
set (P). 

We shall only consider bounded sets in future and shall not specially stipulate 
this. Let (P) and (Q) be two sets, and let us take all the possible distances 
MN between points M of (P) and points N of (Q). We now have set of non- 
negative numbers MN, which must have a strict lower bound 6 [I], 42]. 
The non-negative number 6 is called the distance between sets (P) and (Q). 
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Tueorem I. If (P) and (Q) are two closed sets with no common points, the 
distance ô between them is positive. 

We use reductio ad absurdum. Let 6 = 0. Sets (P) and (Q) have no common 
points, so that there can be no MN = 0. But it follows from the definition of 
strict lower bound that, for any e > 0, there exists an M of (P) and an N of 
(Q) such that MN <e. We can, therefore, choose a sequence of points M, 
of (P) (n = 1, 2, 3, ...) and N, of (Q), such that M, Np — 0. Two cases may 
be imagined as regards the Mp: either there is an infinity of identical points 
among them, or there is not. In the first case, we retain only the pairs Mn Nn 
in which the identical M,, appear (there will be more than one set, if there are 
several such infinite groups of identical points), and we enumerate the pairs 
by means of integers. In the second case, the infinite bounded set of M,, clearly 
has a limit-point M and, in accordance with the above, we can choose a sub- 
sequence of M,, which tends to M. We retain only the pairs M, Nn in which 
members of the subsequence appear and enumerate these by means of integers. 
We carry out a similar process in regard to the Ny. We are now left with pairs 
of M, and Npn such that: (1) M,,N, > 0; (2) Mn tends to M (or coincides with M 
for all n) and N, tends to the point N (or coincides with N for all n). We obtain 
on passing to the limit: MN = 0, i.e. M and N coincide. On the other hand, 
M, as the limit-point of Mn, belonging to (P), is a limit-point of (P) and, since 
(P) is closed, must belong to (P). Similarly, N must belong to (Q). But M and 
N coincide, i.e. (P) and (Q) have a common point, which contradicts the ori- 
ginal assumption of the theorem. The hypothesis that 6 = 0 is therefore false, 
which proves the theorem. 

We carried out the proof for the case when M, and N, do not coincide with 
M and N. If, say, M, coincides with M for all n, whilst N, does not coincide 
with N, we have MN, — 0, where M belongs to (P). We again have MN = 0 
in the limit, and the proof remains as before. The case when all the M,, coincide 
with M and all the N,„ with N clearly contradicts the assumption that (P) 
and (Q) have no common points. 

A repetition of the above proof could be used for the following theorem: 
af (P) and (Q) are closed sets, there is at least one pair of points M of (P) and N 
of (Q) such that MN = 6. 

We introduce one further concept. Let us take all the possible distances 
M’M”, where M’, M” are any two points of a given set (P). The set of non- 
negative numbers M’M” is bounded above, since (P) is bounded, and there- 
fore [I, 42] has a strict upper bound d, which is called the diameter of the set 
(P). If (P) is closed, it can be shown as above that at least one pair of points 
M’, M” of (P) can be found such that M’M’ =d. 

All the above is applicable to three-dimensional space, referred to axes 
XYZ, An e-neighbourhood of a point M must be understood here to mean a 
sphere with centre M and radius e, whilst we must take the cube a < g < b, 
ec<y<d,e<z< f, instead of the squaarrea<xr<be<y<d. 

A square must replace an interval on the z axis. 


90. Interior and exterior areas. We take as the basis for the measure of 
area the statement that the area of a square with sides paratlel to the 
axes is equal to the square of the length of side. We take lines parallel 
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to the axes, dissecting the plane into a mesh of equal squares. A closed domain 
consisting of a finite number of squares of the mesh will be called a domain 
of type (a). 

The area of this domain is defined as the sum of the areas of the constituent 
squares. There is an infinite number of ways of drawing lines parallel to the 
axes that will dissect a domain of type (a) into corresponding squares. We shall 
not stop to prove the simple fact that 
the sum of the areas of these squares is 
always the samo for a given (a) domain. 
Furthermore, if one or more (a) domains 
with no common interior points lie inside 
an (a) domain (4), the sum of the areas 
of these domains is less than the area 
of (A). In future, square will be under- 
stood to mean the square together with 
its boundary. 

Let (P) be a bounded set of points. 
On dissecting the plane with a mesh of 
equal squares, let (S) be the set of all 
the squares of the mesh, all the points 
of which (including also points of their 
boundaries) are interior points of (P). 

We use the same letter S to denote 
the sum of the areas of these squares. 
Clearly, (S) is a domain of type (a) 
(Fig. 81). Further, let (S + S’) be the Fie. 81 
set of squares of the mesh having points 
in common with (P). Let S +S’ de- 
note the sum of the areas of these squares. Evidently, (S + 8’) has the 
same structure as (S), and (S) is part of (S + S’). The latter set contains, 
apart from squares appearing in (S), the set (S’) of squares having points 
in common with (2). 

On taking all the possible meshes of different squares, we gèt an infinite 
set of numbers S. All these numbers will be less than the area of the square, 
inside which the bounded set (P) is situated. The upper bound of the set of S 
is called the interior area of set (P). We denote this by a. Similarly, the set 
of positive numbers (S + S’) has a strict lower bound, which we call the exterior 
area of (P) and denote by A. Finally, let r denote the length of side of a square 
of the mesh. We prove the following basic theorem (ef. I, 115): 

THEOREM. Ifr— 0, S + aand S + S’ + A, ie. as the mesh grows indefinitely 
finer, S tends to the interior area and S + S’ to the exterior area. 

We always have S + S’ > A by the definition of strict lower bound. We have 
to show that, for any given positive « there exists a positive ņ such that S + 
+8’ <A+ecifr < n. From the definition of strict lower bound, a mesh of 
squares exists such that the corresponding sum S + S’, which we denote as 
So + 8’, is less than A + e, ie. S, +S’ < A +e. Let rọ denote the length 
of side of a square of this mesh. The boundary of (S + S^) consists of a finite 
number of lines parallel to the axes. We can border (So + 8’) by squares of 
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side r,/n, where n is a positive integer, so as to obtain an (a) domain S,formed 
by squares of side r,/n which contains (S, + S’) strictly inside itself. If we 
make n sufficiently large, the area S, of domain (S,) will differ by as little as 
required from S, + S’, so that we can write S, < A + e. 

Let (l) be the boundary of (S,) and (l) the boundary of (P). The closed 
sets (l) and (1) have no common points and the distance ô between the sets 
will be positive. If we take r < 6/y2, all the squares of the mesh having points 
in common with (P) will clearly lie inside (S,), and hence, S + 8S’ < S <A +e 
for r > 6/72. We can therefore take the above 7 as equal to 6//2, and we have 
proved that S + S’ — A as r — 0. The proof that S + a as r— 0 is exactly 
similar. 

COROLLARY. (S) is part of (S + S’), and therefore S < S + 8’. We get 
a < Aasr- 0, ie. the interior area is not greater than the exterior area. 

If (P) has no interior points, S = 0 for any mesh of squares and the interior 
area is zero. If there are interior points, an e-neighbourhood of an interior 
point of (P) will contain a square with sides parallel to the axes, all the points 
of which are interior points of (P). We shall have S > 0 for the corresponding 
mesh, so that the interior area of (P) will be greater than zero. 

If the exterior area of (P) is zero, the interior area will certainly likewise 
be zero, and it follows that (P) has no interior points in this case. Sets with 
zero exterior area will be of considerable importance to us later. It follows 
from the above that these are the sets (P) for which the sum of the areas of 
the squares of the (closed) meshes having points in common with (P) tends to 
zero as r — 0. 

It can be shown that closed curves exist which do not cut themselves and 
which have the parametric equations w = y(t), y = y(t), where p(t) and y(t) 
are continuous functions, whilst their exterior areas are greater than zero. These 
curves are the boundaries of connected domains, as may be shown, and interior 
areas are less than exterior areas in such domains. 


91. Measurable sets. A set (P) is said to be measurable if a = A, ie. if ite 
interior and exterior areas are equal. The common value of a and A is called 
the area of set (P). We notice that, if the exterior area is zero (A = 0),a = 0, 
as we saw above, i.e. the set here is measurable and its area is zero. Conversely, 
if a set is measurable and its area is zero, it is obvious that A = 0. 

The necessary and sufficient condition for a set to be measurable is evidently 
that S and S + S’ have the same limit as r — 0, in other words, that S’ + 0 
as r — 0. This means that, given any positive e, there exists a positive 7 such 
that S’ < e ifr <n. 

If the boundary (I) of (P) has zero area, S’ + 0 as r — 0, since all the (closed) 
constituent squares of S’ have points in common with (l). The converse is also 
true, i.e. if S’ — 0 as r— 0, the area of (I) is zero. The proof is as follows. 
Squares of any mesh that have points in common with (J) can only be absent 
from (S’) in the case when their only common points with (1) are on their bound- 
aries, since if points of (l) lay inside a square, points of (P) would also lie inside, 
i.e. the square would belong to (S’). Squares of this sort — having points in 
common with (l) only on their boundaries, and neither containing points of 
(P) nor belonging to (S’) — can in fact exist (there is a finite number of such 
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squares), though there must be points of (P) in at least one of the eight squares 
that border such a square. If we add to all the squares that belong to (8°) all 
the squares that border on them, we must obtain all the squares without excep- 
tion having points ia common with (l). The area of all the squares thus obtained 
is not greater than 9S’, and, since S’ + 0 by hypothesis, as r — 0, we can say 
that the exterior area of (l) is zero, i.e. simply, the area of (l) is zero. We get the 
following important theorem from the above. 

THEOREM. A necessary and sufficient condition for a set (P) to be measurable 
is that the area of its boundary (l) ie zero, i.e. it 1s necessary and sufficient for the 
sum of the areas of the squares of the mesh that have points in common with (l) 
to tend to zero as r — 0. 

Remark. This theorem can be re-stated as follows by using the theorems of 
[90]: the necessary and sufficient condition for (P) to be measurable is that, for 
any given positive £, there exists a mesh of squares such that the sum of areas of 
squares of the mesh with points in common with (l) is lesa than e. 

If the exterior area A = 0 for a given set, we have all the more A = 0 for 
part of the set, i.e. any part of a set with zero area also has zero area. 

We note a further property of sets of zero area. By using the above method 
of surrounding a square with its neighbouring squares, it can easily be shown 
that, for any given positive €, a set with zero area can strictly enclose an (a) 
domain of area less than e. 

We now consider domains, instead of sets in general. 

Let a measurable domain (P) be divided into two parts (P,) and (P,) with 
the aid of a set (of lines) (A), the exterior area of which is zero. This means 
that interior points of (P,) and (P,) are interior points of (P), not belonging 
to (A). 

It follows from the above that (P,) and (P,) are measurable, whilst the sum of 
their areas is equal to the area of (P). The same is true when (P) is divided into 
any finite number of domains. Conversely, if we combine into one set any finite 
number of measurable closed domains (or sets) (P) with no common interior 
points, the new set (P) is measurable, and its area is equal to the sum of the 
areas of the original sets. Points on the boundaries of the (P) may become 
interior points of the total set. If a measurable domain (set) (Q,) is part of a 
measurable domain (set) (Q.), i.e. if every point of (Q,) belong to (Q,), the area 
of (Q,) cannot exceed that of (Q,). All this follows from the above definitions 
and theorems. 

We shall in future take it for granted that the sub-division of a measurable 
set is carried out with the aid of a set of points with zero exterior area. 

A simple example may be given of a curve (A) with zero exterior area, i.e. 
such that the sum of the areas of squares of the mesh having points in common 
with (A) tends to zero along with r; at the same time, we take (A) to have the 
explicit equation y = (x), where x varies in a finite interval (a, b) and g(x) 
is continuous in this interval. By uniform continuity, for any given positive 
e there exists a ô such that | g(x”) — p(x’) |< e/3(b—a) if x’, a” belong to 
(a, b) and | x” — x’ | < ô [I, 43]. We choose r less than 6 and less than e/3(b — a). 
We construct the mesh by dissecting (a,b) with the points a = x, <a, < 
LTr <<... < Lnr < 2%, =b, where the lengths (x, — kı) are all equal to 
r except for the extreme dissections, (x, — a) and (b — x,_,) being possibly less 
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than r (Fig. 82). We consider the squares of the mesh lying in the band between 
£ = Gk aNd x = xz. Since Lk — Lg < 6, it can be asserted that the difference 
wp between the greatest and least values of g(x) in the interval (x,_,, xy) (i.e. 
the oscillation of v(x) in the interval) is less than ¢/3(b — a). The square with a 
point in common with the lowest point of y= g(x) can go at the most a distance r 
lower (the length of side of the square), whilst the square with a point in common 
with the highest point of the curve can go higher by r at the most. The sum of 
the heights of the squares that have points in common with (A) and that lie in 
the band x = w,_,,% = xx, is thus less than 
e/3(b — a) + 2r, or less than ¢/(b — a), since 
r < ¢/3(b — a), whilst the sum of their areas 
is less than e(2, — x,_,)/(b — a). We see, 
on summing from k = 1 tok = n, that the 
sum of the areas of the squares with points 
in common with (A) is less than e, and it 
follows, since s is arbitrary, that the exterior 
area of (A) is zero. The proof is the same, 
that a curve with explicit equation x = p(y), 
where y(y) is continuous in a finite interval, 
also has zero exterior area. A simple curve 
AXX... “Xp b is defined as one which has an equation 
y = p(x) or x = p(y), where (zx) or y(y) is 
Fra. 82 continuous in the corresponding finite inter- 
val of variation of the independent variable. 
It follows from the above that the exterior area of a simple curve is zero. 
Hence follows a sufficient test as to whether a domain is measurable. 

THEOREM. If the boundary of a domain (or set) (P) is a simple curve, (P) 
is measurable. 

It follows from the above discussion that, if we divide a measurable domain 
into a finite number of domains by means of a simple curve (or, what amounts 
to the same thing, a finite number of simple curves), each new domain is also 
measurable, and the sum of the areas of these is equal to the area of the original 
domain. It must be remembered that what has been said is applicable to the 
division of any measurable set into a finite number of measurable parts. 
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It is easily shown that the definite integral f p(x) dx gives the area, in the 
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above sense of the word, of the domain bounded by the curve y = p(x), the x 
axis, and the straight lines x = a, x = b, it being assumed that g(x) > 0. 


92. Independence on the choice of axes. The definition of (interior and 
exterior) area is closely bound up with the choice of axes, inasmuch as our 
measurements were made with a mesh of squares with sides parallel to the axes. 
Parallel displacement of the axes evidently plays no part in the measurements, 
but rotation of the axes about the origin makes a fundamental difference, since 
(P) now has to be covered with a different mesh of squares. Instead of turning 
the axes counter-clockwise by an angle gy, we can keep them fixed and rotate (P) 
by an angle (— g) about the origin. It is evident from this that the independence 
of the area on the choice of axes amounts to the area being unchanged on 
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displacement of (P) as a whole over the plane. The case of parallel displacements 
is obvious, but that of rotation about the origin requires proof. 

We can prove the following theorem, analogous to that of [90]. 

THEOREM I. Let the complete plane be divided into measurable domains (4,) 
with diameters not greater than a number d and such that any bounded part of 
the plane has points in common with only a finite number of these domains. Let 
Z denote the sum of the areas of those domains, every point of which, including 
the boundary points, is an interior point of (P); also, let E + X’ denote the sum 
of the areas of domains with at least one point in common with (P). Then if d — 0, 
X tends to the interior area of (P), whilst E + X” tends to the exterior area of (P). 

The implication of this theorem is that, in evaluating the exterior and interior 
areas, we can use any mesh of measurable domains, provided d — 0, instead 
of a mesh of squares with sides parallel to the axes. 

We note that the boundary of a square, however disposed relatively to the 
axes, is a simple curve, and any square is therefore a measurable domain. The 
above theorem, which we shall only use in the case when the squares of our 
mesh are equal, has the direct consequence that the area of a domain may be 
measured by using say a mesh of squares with sides not parallel to the axes, 
provided the side tends to zero. But here we need to know the value of the 
area, of a square with sides not parallel to the axes; that this area is equal to 
the square of the length of side is still not explicit, strictly speaking, since our 
basic theorem on the measurement of area assumed a square with sides parallel 
to the axes. If we can prove that the area of any square is equal to the square 
of the side, we can assert, by the above, that measurability and area are both 
independent of the choice of direction of the axes, and that an area is unaffected 
by a rotation. 

All this leads us to proving Theorem II. 

THEOREM II. The area of a square with sides parallel to the axes is unchanged 
on rotation about the origin. We first of all recall that the boundary of any square 
is a simple curve, so that any square is a measurable domain. Let (q) be the 
initial square, with side r, and let (q,) be the square obtained by rotation. 
We use the letters without brackets to denote the corresponding areas and let 
qı/q = 8. The ratio q,/q will be the same for a givenrotation of the axes for all 
squares of side r, since we can make any parallel square of side r coincide 
with (q) by a parallel shift of the axes, which does not affect area. We now 
carry out atransformation of similitude on the plane, with centre at the origin, 
in which the length of any radius vector from the origin is multiplied by some 
positive number k. The transformation amounts to shifting a point with 
coordinates (x, y) to the point (kx, ky) [3]. All dimensions of length will be 
multiplied by k. Every square with sides parallel to the axes becomes a similar 
square with k times the length of side. It follows that an (interior or exterior) 
area is multiplied by k?. Let (q¢’), (qi) denote the squares obtained by the trans- 
formation of similitude from (q), (q,) respectively. Obviously, (q1) is found from 
(q’) by the same rotation as that giving (q,) from (q). But qi = k? q and q’ = k? 
q, so that q,/9’ = 8. Now we can transform the square (q) into a square with 
any length of side by a suitable choice of k. Hence we see that the ratio 
q/q = 8 is the same for all initial squares (q) for a given rotation of the plane. 
We now prove that s = 1. We take the circle x? + y? < l with centre at the 
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origin and unit radius, covered with a mesh of squares with sides parallel to 
the axes. This circle is clearly a measurable domain. 

The area of a square is multiplied by 8 on rotation about the origin, and by 
the definition of area and the theorem proved above, the area of the circle 
must also be multiplied by s. But the circle is unaffected by rotation, so that 
its area must be unchanged, i.e. 8 = 1, which proves our theorem. 


93. The case of any number of dimensions. The entire theory of areas can 
be carried over to the case of three-dimensional space, giving us interior aud 
exterior volumes and measurable three-dimensional domains or sets. Cubes 
perform the role of squares. 

We can construct an analogous theory of the measurement of “area”, or 
theory of measure, for any n-dimensional space. A point becomes the set of 
n real numbers (%,%,,.--,%,), arranged in a definite order. The distance 
between two points (2, £z ...,%p,) and (y,, Yz ---, Yn) is given by the expres- 


sion: 
n 
1 
r= |] > (Ys— T) 
s=1 


A sphere with centre (a, Gz, ...,@,) and radius ọ is defined as the system 
of points (x,, Xz, ...,%,) whose coordinates satisfy the inequality 


n 
Z (Bs — as)? < o. 
s=1 


Finally, a cube with edge r is defined as the system of points whose coordinates 
satisfy the inequalities a, < 2, < b; (3 = 1l, 2, ..., 2), where b, — a = r. We 
call r” the m sure of the cube. These definitions together enable us to re-state 
all the abovet eory for n-dimensional space and establish the concepts of interior 
and exterior :easure of'a domain in general, as also to speak of a domain 
(or set) being measurable. All the theorems proved are valid for n-dimensional 
space. A parvilel shift is now given by 2’, = 2, + a, (s = 1, 2, ..., n), whilst 
rotation about the origin is expressed by a linear transformation, in which 
the distances of points from the origin remain unchanged. A more detailed 
treatment of these transformations will be found in Volume ITI. 

We used the idea of a step-line, i.e. a line composed of a finite number of 
pieces of straight lines, in defining a connected domain. A straight line in n- 
dimensional space is defined as a line (i.e. a set of points) having the parametric 
equations x, = p(t), where the p(t) are first degree polynomials. A domain 
in n-dimensional space is exemplified by the set of interior points of a sphere 
or a cube. Such a domain is usually defined by certain inequalities which the 
coordinates of its points must satisfy. We remark that, with n = 1, ie. on 
a straight line, a connected domain must mean the set of interior points of 
a certain interval, What we have said about simple curves is easily generalized 
for n dimensions. In particular, if z = g(x, y) is the explicit equation of a given 
surface in three-dimensional space, where p(x, y) is continuous in some bounded 
closed domain of the XY plane, the surface is a measurable set and its measure 
is zero. The further concept of a simple surface is easily built up after the 
manner of [91], and any domain bounded by a simple surface will be measurable. 
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94. Darboux’s theorem. Having established the concept of area, we turn 
to the theory of double integrals. All our discussion will equally apply to triple 
integrals, and will be arranged along the same lines as used for ordinary integrals 
in Volume I; detailed arguments will be omitted when precisely analogous 
to those of Volume I. 

Let (g) be a measurable domain in the plane and let f(N) be a bounded 
function, defined at all points of the closed domain (a). We divide (ø) into a 
finite number of measurable domains (op) (k = 1,2, ...,”) and let o and 
a, denote as usual the corresponding areas, so that o = 0, + o,-+ ... + On. 
We take the diameter of any (ap) to be less than some number d and let Np 
be any point belonging to the closed domain (0%). 

We form the sum of products: 


> IN oe (1) 
kal 


The type of function f (N) for which this sum has a definite limit as d + 0 will 
be discussed later. Let M, and m, be the strict upper and lower bounds of 
Jf (N) in the closed (ap). As well as the sum (1), we form: 


n 
S = 2 Myk Ok (2) 
k=1 
n 
8 = > Mk Ok (3) 
k=1 
We have, as in [I, 115]: 
n 
8 < X INDE <S, (4) 
k=1 


and we can say that, for any dissection of (a), S and æ lie between the bounds 
mo and Mo, where M and m are the strict upper and lower bounds of f (N) 
in the closed (g). 

We now consider S in more detail, and assume that f(N) is always positive. 
Let three measurable domains (of), (of), (of) with no common interior 
points lie inside (ox), and let the strict upper bounds of f(N) in the new domains 
be respectively MP, M2, MP). Since MP, MP), ME) < Mp, of) + of) + 
+ a) < ox, and all f(N) are positive, we can write: 


MY oD) + MP oD + MP cB < M, op (5) 


Let L be the strict lower bound of all possible values of S. We prove that 
S— L as d — 0. It is sufficient to show that, for any positive e, there exists 
a positive 7 such that S < L + € provided d < 7. By definition of L, there 
exists a fully defined rule (I) for dissection of (o) into (a;) such that the value 
of S thus obtained, say S’, is less than L + e/2. Let (A,) be the closed set of 
points consisting of points of the boundary of (o) and of the boundaries of all 
the (o,). The area of (A,) is zero by the definition of measurability, and we 
can enclose (4,) strictly inside a finite number of squares, the sum of the areas 
of which is less that ¢/2M. Let (1,) be the boundary of the closed domain (Q,) 
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formed by these squares. Let 6 be the positive distance between the closed 
sets (A,) and (la), which have no common points. We show that it is sufficient 
to take 1 = 6/2. Let us take d< 6/2 for the dissection of (a). We divide the 
sub-domains obtained into two classes: those that have no common points with 
(åa) go into the first class, and the remainder go into the second. Let us call 
sub-domains of the first class (g,), and of the second class (tm). The sum S is 
now divided into two sums: S = S, + S,, where 
S= Y U S= Smt 

and u; and vm are the strict upper bounds of f(N) in the closed (o,) and (tm) 
respectively. 

Every (0;) lies inside a certain (0;) of the first rule (I) for dissection of (ø), 
and by (5), the sum of the terms of S, for which (a;) lies inside (0;) is not greater 
than Mk dp 80 that S, < S’, ie. since S’ < L + e/2, we have S, <L + ¢/2. 
As regards Sz, the (Tm) have points in common with (4,) and their diameters 
are less than 6/2; hence, they all lie inside the squares forming (Q,). It follows 
that 27,, cannot exceed the sum of the areas of these squares, i.e. Etm < ¢/2M. 
We have vm < M, so that 


€ e 
Sı= Zna < M gg = 


The required inequality S < L + e for d <7 follows from S, < L + e/2 
and S, <e«/2. Our assertion that S— L is proved for positive functions. 
We seo that it is true for any bounded function by repeating the argument 
of [I, 115]. Similarly, we can show that s— l as d — 0, where J is the strict 
upper bound of s. This gives us the following theorem. 

DARBOUX’S THEOREM. On indefinite decrease of the greatest of the diameters of 
the sub-domains (0;), the sums s and S tend to definite limits l and L, where l< L. 

The whole of the above discussion is applicable word for word to the case 
when. (g) and (Gg) are any measurable sets. Darboux’s theorem remains valid. 


95. Integrable functions. We call f(N) an integrable function over (ø) if 
the sum 


P ANg) ox (6) 


has a definite limit as the greatest of the diameters d of the sub-domains (sub- 
sets) (op) tends to zero. The limit is called the double integral of f(N) over the 


domain (set) (a): 


Sf AN) do = lim X AN, or 
(2) k=1 

It can be shown, as in [I, 116], that the necessary and sufficienti condition for 

S(N) to be integrable is that the limits l and L of the sums s and S coincide, i.e. 

that the difference between the sums: 


n 
© (Mk — My) Op (7) 


tends to zero as d — 0. The common limit of s and S is the value of the integral. 
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If f(N) = 1, (6) is always equal to the area o of the domain (set) (ø), i.e. 


f§do=o. 


(2) 


We also note that, by Darboux’s theorem, the above condition for integrability 
can be re-stated as: for any given positive e there exists a dissection of (0) into 
sub-domains such that expression (7) is less than e. Use of the condition for inte- 
grability reveals certain classes of integrable functions. 

1. If f(N) is continuous in a closed domain (set) (0), it is integrable. The proof 
is the same as in [I, 116]. 

2. Now let f(N) have points of discontinuity, whilst being bounded as before, 
in the closed domain (a). If we assume that the area of the set (R,) of points 
of discontinuity is zero, we can show that f(N) is integrable. 

We first let (a) be a square with sides parallel to the axes and we let it be 
dissected into squares; the case of any measurable domain will be considered 
in the next section. Let £ be a given positive number; (a) can be divided into 
equal squares in such a way that every point of (R,) lies strictly inside an 
(a) type domain, the area of which is less than e/24, where u = M — m [cf. 
91]. We have uniform continuity of f(N) in the remaining closed (a) domain, 
since this contains no points of (R,). Let these (a) domains be (E) and (F). 
Since f(N) is uniformly continuous in the closed domain (F), this can be dis- 
sected into as small sub-domains as required, such that the sum of the terms of 
(7) which relate to these is less than «/2. On taking into account that My 
—m< M — m= u, and that the area of (E) < ¢/2u, we can see that the 
terms of (7) relating to (E), which cannot be sub-divided, is less than ¢/2; 
hence, the whole of (7) is less than e, so that f(N) is integrable. 

It follows that, if the set of points of discontinuity of a bounded function f(N) 
has zero area, f(N) is integrable. This condition is certainly satisfied if f(N) has 
a finite number of points of discontinuity or if the points of discontinuity lie on 
a finite number of simple curves. 


96. Properties of integrable functions. We briefly indicate the properties of 
integrable functions, corresponding to [I, 117] for single integrals. 

I. If f(N) is integrable in a measurable domain (a) and we change the value of 
S(N) at a set of points (Ry) of zero area whilst preserving the boundedness of the 
function, the new function is also integrable and the value of the integral remains 
unchanged. 

The proof is much the same as at the end of the previous article when (0) 
is a square with sides parallel to the axes. The (a) domains (E) and (F) are 
distinguished as before. The value of f(N) is unchanged in (F), and since f(N) 
is integrable, the terms of (7) relating to (F) have a sum less than ¢/2 for a 
sufficiently fine dissection of (F). The terms relating to (E) are less than ¢/2 
by the smallness of the area of (E), and hence follows the integrability of the 
new function, as above. Further, the set (R,) has no interior points, and there 
is a point N, in each of the (o) acquired by dissection of (o) at which the 
value of f(N) is unchanged. We see by using these points in forming the sum (6) 
that the value of the integral is unchanged. 
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II. If f(N) is integrable in a measurable domain (a) and the domain is divided 
into a finite number of measurable domains (0,), (02), .»+) (On), f(N) will be integrable 
in each of the (o,) and the integral over (0) is equal to the sum of the integrals over 
the (Ox). 

We dissect (a) by subdividing the (o). The sum (7), consisting of non- 
negative terms, will tend to zero by the integrability of f(N). The sum of the 
terms relating to each particular (0,) will all the more tend to zero, i.e. f(N) 
is integrable over (op). The second part of the statement follows directly, if 
passage to the limit in sum (6) is carried out for each summation relating to 
each particular (o). The converse is obviously also true: integrability over the 
(Ox) implies integrability over (a). The other properties of integrals given in 
[I, 117] also remain valid; these relate to the removal of a constant factor 
outside the integral sign, to the integrability of the sum, product and quotient 
of integrable functions, as also of the absolute value of an integrable function. 
The proof of the mean value theorem follows the same lines as in [I, 95}. 

We next turn to the proof of the second condition for integrability [95] 
and of the first property of the present section for the case of any measurable 
domain (a). Let the bounded function {(N) be given in this domain and on its 
boundary, the set (R,) of points of discontinuity of the function having zero 
area. We draw a square (Q) with sides parallel to the axes and with (0o) lying 
strictly inside, and we take a new function f,(N), defined as follows in (Q): 
fi(N) = f(N) in the closed (0) and f,(N) = 0 elsewhere. The points of (Q) belong- 
ing neither to its boundary nor to the closed (a) can easily be seen to form an 
open set (domain), say (g,). The boundary points of (o,) belong to the boundary 
(l) of (a) and to the boundary of (Q). Since (g) is measurable, the area of (l) 
is zero and the same can be said of the area of the boundary of (Q). 

Hence the boundary of (¢,) also has zero area, i.e. (g,) is measurable. The 
points of discontinuity of f,(N) in (Q) are the points of (R,) and also, possibly, 
points on (l). The area of the set of points of discontinuity of f,(N) in (Q) is 
zero in either case, so that f,(N) is integrable in the square (Q) [95], in other 
words, it is integrable over (a) and (a,). 

But f,(N) = f(N) in the closed (ø), so that f(N) is integrable over (ø), which 
it was required to prove. The proof of the first property of integrable functions, 
carried out above for a square, may be carried out on similar lines for f(N) 
and any measurable (ø). 

We return to f,(N), which is zero at interior points of (¢,), so that its integral 
over (o,) is zero, whence: 


SIAN) do = $ f A(N) do. 

(0) Q 
It may be remarked that, since the boundary (l) of the measurable (a) has zero 
area, the value of the bounded f(N) on (l) does not affect the value of the 
integral. 


97. Evaluation of double integrals. We now establish the formula for reducing 
a double integral to two quadratures. We first take the case of a rectangle (R) 


with sides: 
=a; ©=b yo; y=d, (8) 
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parallel to the axes. We let f(N) = f(x, y) be integrable over (R), i.e. there 
exists 


SI EN) do = f $ Kæ, y) de dy. (9) 
(R) (R) 
We further suppose the existence, for every x of the interval (a, b), of 
d 
= f æ, y)dy, (a <s <b) (10) 
c 
and of the iterated integral: 
b b d 
§ F(a) da = f [ f f(x,y) dy] dz. (11) 
a a c 


We dissect (R) by sub-dividing the intervals with the points 
a= Ty <2 L Eg Lee L Eg- C Cp = b 
C= Yo < Ya < Ya <.< Ym- < Ym = d 


and we let (Ri) denote the sub-rectangle of the dissection, bounded by: x = x; 
8 = Lipu Y = Yk Y = Yk: Also, let m, and M,, denote the strict lower and 
upper bounds of f(x, y) in the closed (Ri), and let Ax; = x4, — 2p Ay, = 
= Yk} — Yr- On integrating the inequality 


Mig < H(z,y) < My ((@,y) from (Ry)] 


over the interval Yk < Y < Yk we get: 
Ytp 
Mir Ayk < $ Key) dy < Mi Ayy (Ti <2 < 2441), 
Yk 
where (Yk, Yx4;) is part of (c, d) and the integral exists since (10) exists [I, 117]. 
We obtain on adding these inequalities: 


m-1 m—I 
2, Mik Ayn < i f(x,y) dy < P Mik Ayg. 


We integrate over the interval (£; 2;4,): 


m-l 
È My, Ay, Atı < T fiey Waylaz< Ý Mik Ay, A233 


= x c 


the integral just written exists, since (11) exists. We sum these latter inequalities 
over i: 


n—l m-I n~l m-l 


> PA Mig Ay, Ax; < j [ j f(x, y) dy} dx < z 2, Mik Ay, Axi 


i=0 k=0 


On noticing that dy, 4x; is the area of (Rix), we can say that the extreme terms 
of the inequality tend to integral (9) as the sub-rectangles diminish indefinitely, 
which proves the required formula: 


SS Kæ, y) de dy = J [ jie, y) dy} dz, (12) 
(R) 
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i.e. ¿f double integral (9) and iterated integral (11) exist, (12) is valid and the 
integrals are equal. 

We remark that the existence of integral (11) presupposes the existence 
of (10). If f(N) is continuous in the closed rectangle (FR), (9) and (10) clearly 
exist ([95] and [I, 116]). With this, as we saw in [80], (10) is a continuous func- 
tion of 2, so that (11) also exists. We now consider a domain (g), bounded by 
the curves y = p(x), y = 9,(x), and by the lines x = a, x = b (Fig. 83). We 

assume the existence of the double integral 


SIAN) do = f f f(@,y)dedy, (13) 
(2) (2) 
of the single integral 
p(x) 
F(x) = f K(x, y)dy (14) 
g(x) 
and of the iterated integral 


b @,(x) 
froda fi f Kæ y)dy]de. (15) 


a glx) 


Let (R) be the rectangle formed by the 
lines (8), where c, d are chosen such that 
c > p(x), d> px) for all x of (a, b), i.e. (a) 
forms part of (R). We define the function 
JAN) =f,(v,y) in (R), equal to f(N) at points of (o) and zero throughout 
the remainder of (R). The curves y = 9,(z), y = 9,(#) divide (R) into three 
parts: (a), and domains (I) and (II), lying below and above (0) (Fig. 83). We 
have f,(N) integrable in (a), since it coincides there with f(N), and integrable 
in (I) and (II), since it is zero at interior points of these domains. 
Hence f,(N) is integrable in (R) [96] and 


Fie. 83 


S$ AN) do =f f 4) do. (16) 
(R) (2) 
Similarly, for every x of (a, 3 we have the existence of 
p(x) 
-fnt y)dy = $ f(y) dy (17) 
lx) 


and of integral (15). 
It follows that (12) is valid for f,(N), and by (16) and (17), the reduction of a 
double integral over (g) to an iterated integral is obtained from (12) as: 


b plx) 
S $ Hz, y)dxdy =f{ f fz, y)dy]de. (18) 
0) a g(x) 


We assumed the existence of integrals (13), (14) and (15) for this argument. If 
f(x,y) is integrable in the closed domain (ø), integrals (13) and (14) exist, as 
above. Moreover, by [80], (14) defines a continuous function in v, and therefore 
(15) also exists. The formula for reducing a triple to an iterated integral, the 
latter consisting of three quadratures [58], can be proved in the same sort of way. 
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98. n-tuple integrals. All the remarks of [94] and [95] can be carried over 
directly to the case of n dimensional space, leading to the concept of the integral 
of a bounded function over a bounded measurable domain in n dimensions, as 
well as to the condition for integrability given above and to the usual properties 
of integrals. A formula exists, similar to that of [97], for reducing an n-tuple 
integral to an iterated integral, made up of n quadratures. The formula can be 
proved by induction, being varied by unity. The limits of the multiple integral 
are obtained from the inequalities that define the domain of integration. Let 
S(N) = f(x, £z .-+, Zn) be continuous in a closed measurable domain (Pn) inn 
dimensions, interior points of the domain being defined by the conditions: 
The points (x,, £a, ..., Gn) are interior points of a certain measurable domain 
Qn—1 Of (n — 1)-dimensional space, and the 2, satisfy the inequalities 


Pi (Lis Lore ey yg) <L En < Py (Lis Tor- +s Vpn), 


where pi(£i Xo, ..+,%p—,) and Pz(Tis Las -..,%_,_,) are continuous functions in 
Qn- The n-tuple integral is now given by a quadrature with respect to x, and 
an (n — 1)-tuple integral over (Q,_,): 


ss wee | Käis Beye. Bq) dg.. dzy = 
(Pr) 


Pa(X1)...,%n—1) 
= ff .f f Kais -es Ep) d2p] day. < «Alpar: (19) 
(Qn) P2(Xty.20) Xm) 


The generalization to n dimensions of a plane rectangle with sides parallel to 
the axes is the prismatoid (R,,), defined by: 


a, S T1 < bi; Qg < Za < bje; ay SEn < by (20) 


Integration over this prismatoid reduces to an iterated integral in which all the 
limits are constant: 


b, bn—ı bn $ 
SS... S Han. En) day... den= $ day... f dana | Kaise -s En) dep 
(Ra) a Anm an 


where the order of the integrations can be varied arbitrarily, provided the limits 
remain unchanged. 

The formula for change of variables in an n-tuple integral may be mentioned 
for the benefit of readers familiar with determinants. Let the variables 
(£i £a ..+, 2n) be replaced by new variables (2), 23, ..., £n), where 


Ti = qi (Ti, ©},...,2n) (6=1,2,...,0). (21) 
We now bring in the functional determinant of system (21): 
Op, pı dp 
Oxi? 3x3” ` Oap 
Op, Op, OP, 
D =| Ox’ 0a5°°°"’ Of |. (22) 
IPn IPn P IPn 
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The formula for change of variables runs 
S$... f fda. ..da,=J 9... [JD] dat... das, (23) 
(Pn) (P'a) 


where the inequalities defining the new domain of integration (P,) are obtained 
from those defining (P,,) by replacing the x, appearing there by expressions (21). 
The conditions for (23) to be applicable are the same as laid down for double 
integrals in [77]. Improper n-tuple integrals are defined in the same way as 
improper double and triple integrals [86]. We now turn to some examples. 


99. Examples. 1. A tetrahedron in n-dimensional space, bounded by the 
hyperplanes 


Tı = 0; z, = 0;...; 2, =0; 2, + 2+.. +z =a (a >0), 
s defined by the inequalities: 
gı > 0; z, >0;...; T> 0; s t2 tH. tHE (24) 


With n = 3, an ordinary tetrahedron is obtained, bounded by the coordinate 
planes and by the plane x + y + z =a. We introduce new variables by put- 
ting: 

a(z, +... + 2n) 


a=a,tat...4+ 293 wy == —2 
z n By ne e E T 
——_ a(tg-+... +2) es GEJA 
SS a ee Oey. ee a ee 
Let...ta, ' Ly-1 + Ly 


whence it follows that 


Byte. oy = ti; a(t, +.. .+ £r) = zi 23; 


n-1 


a(z, +.. H2) = 21 23 23... Aw, = wi 2Y.. wh. 


Conversely, the old variables are given in terms of the new by the expressions 


, a ta a, "A 
oe xi (a — x2) ese xi xz (a — x3) Fae 
a at eee 
‘ , LA L , , A 2 
_ © %2..-%n-1(a zrn) | Six... Eh 
Er-1 = ani ; Tr = ami 


It follows at once from these formulae that tetrahedron (24) can be replaced 
by the n-dimensional cube: 


0 < zi <a; 0< gri <a... 0< Th <a. (25) 


2. We find the measure (volume) of the n-dimensional sphere with centre 
at the origin and radius r, defined by 


z3 Haet. aire (26) 


If a transformation of similitude is carried out with coefficient k, the volume 
of a cube is multiplied by k”, and the radius of a sphere by k. It follows at once 
from this that the required measure v, is a function of r only, of the form 


v= 0r", (27) 
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where C,, is a numerical constant, different for different n. If sphere (26) is 
cut by a constant 2, plane, it is clear from (26) that an (n — 1)-dimensional 
sphere is obtained, the square of the radius of which is equal to (r?—2?). The 
volume of the new sphere will be C,,_,(r? — a2)("—via by (27). The part of the 
n-dimensional sphere lying between the planes x, and 2, + dz, will have a 
volume C,_,(r? — 2%)@-V/a dv, whence we have the following expression 
for vp: 
+r n—l 

Vn = Cyr" = Op- S (r?—a}) 2 dz, 
—r 


or, on substituting x, = r cos p, we have the following relationship between 
Cn and Oni: 


n 


n 2 
On =Cya Í sin" p dp = 20 p3 f sin” ọ dg, (28) 
0 0 


where, as we know from [I, 100), 


(n — 1) (n = 3)...1 


o 
— wa 


in” a re 

sin” 9 dg = nin — 2...2 5} for even n, 
n 
(n —1) (n — 3)...2 
ofa _(n—l)(n—3)... 

[sin pdp = nin — 2...3 for odd n, 


0 
On replacing n by (n — 1) in (28), we get: 
a 
z 
On-1 = 2C n-z Í sin"! g do. 
0 
It follows from the equations written that, for any integral n: 


O= 0 (29) 


But we know that C, = 2, C; = 42/3. Hence we have from (29): 


ea?” 
C,= na- JE for even n, 
gant) ge) 
O= for odd n. 


n(n — 2)...1 


CHAPTER IV 


VECTOR ANALYSIS AND FIELD THEORY 


§ 10. Basic vector algebra 


100. Addition and subtraction of vectors. The present chapter is 
mainly concerned with vector analysis. Since a number of specialized 
treatises are now available on the subject, we shall confine ourselves 
to the broad outlines, including only details directly connected with 
the previous matter and essential for our treatment of the founda- 
tions of mathematical physics. 

We encounter two types of magnitude, scalars and vectors, when 
studying physical phenomena. 

A scalar is a magnitude which, for a given choice of unit, is fully 
characterized by the number measuring it. 

For instance, the temperature at any point of a heated body in 
space is characterized by adefinite number, and we can, therefore, say 
that temperature is a scalar. Density, energy and potential are further 
examples of scalars. 

Velocity may be taken as an example of a vector. To characterize 
a velocity, we need to know its direction as well as the number meas- 
uring its magnitude. In other words, we need to construct a vector: 
a straight line whose length is equal to the magnitude of the velocity 
on a given scale and the direction of which coincides with the direction 
of the velocity. A vector is fully defined by its length and direction. 
Force, acceleration and impulse are also vectors. 

We return to our example of a heated body. The temperature u 
at any point of the body is characterized by a definite number and 
can be said to be a function of the points in space occupied by the body. 
If space is referred to a system XYZ of Cartesian coordinates, we can 
say that the scalar u is a function of the independent variables (x, y, z), 
defined in the domain of space occupied by the body. Here we have 
an example of a scalar field. 


298 
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If a vector is defined at every point of a certain domain, we get a 
vector field, a suitable example being the electromagnetic field, 
where the electric and magnetic forces are defined at every point. 

It becomes important in certain cases to know the point of appli- 
cation of a vector, i.e. the point in space at which the origin of the 
vector is situated. We shall be concerned in future, not so much with 
such tied vectors, as with free vectors, for which the point of appli- 
cation is arbitrary. We shall therefore look on two vectors as equal 
when they have equal magnitude (length) and the same direction. 


) / 
P E S, if 
A A / B 
= To=-B 
aS 
Fia. 84 Fie. 85 
Vectors will be written in heavy type A,B, ..., their respective 
magnitudes (lengths) being denoted as |A l, |B |, .. .; letters in normal 


type are used for scalars. 

Let A, B, C be three vectors. We draw A with its origin at a given 
point O, we draw B from the terminus of A, and C from the terminus of B. 
The vector S, with origin at the origin of the first vector and terminus at 
the terminus of the last vector, is called the sum of the given vectors: 


S=A+B-+C. 


Vector sums have the commutative and associative properties of 
ordinary sums, expressed by the formulae (Fig. 84): 


A+B=B+A; A+(B4+C)=(A+B)+C. 


If from the terminus of A, a vector € is drawn, equal in magnitude and 
opposite in direction to B, the vector M with origin at the origin of A and 
terminus at the terminus of C is called the difference between A and B 
(Fig. 85): 

M=A—B. 


This vector is easily seen to be fully defined by the relationship: 
B+M=A. 
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In general we write (—N) for the vector with the same magnitude 
but opposite direction to N. The difference between A and B can then 
be defined as the sum of A and (-—B), i.e. 


A+(—B)=A-—B. 


It is easily shown that the vector sum and difference thus defined 
are subject to the same rules as ordinary algebraic sums and differ- 
ences, and we shall omit the proof. 

The rule for vector addition has numerous applications in mechanics 
and physics. For instance, if a particle is subject to several types of 
motion, its final velocity is found by vector addition of the velocities 
due to each individual motion. Similarly, vector addition gives the 
resultant of several forces acting on a particle. 

If the terminus of the final vector in a sum coincides with the origin 
of the initial vector, i.e. if the step line given by the above construction 
is closed, the sum is said to be zero: 


A+B+C=0. 
In particular, we obviously have: 
A+(—A)=0. 


In general, a vector is said to be zero if its magnitude is zero; there is 
then no occasion for speaking of its direction. 


101. Multiplication of a vector by a scalar. Coplanar vectors. Given 
the vector A and the real number a, the product aA or Aa is defined as 
the vector of magnitude |a|- |A|, its direction being the same as A if 
a > 0, and opposite to A if a < 0. If a = 0, aA also equals zero. 

Hence, if A and B are two vectors with the same or opposite direc- 
tions, the relationship exists between them: 


B= nA, 
which may be written more symmetrically as: 


aA + bB = 0, 
on setting n = —a/b. 
Conversely, the existence of this relationship shows that A and B 
have the same or opposite directions. , 
Now let A and B be any two vectors whose directions are neither 
the same nor opposite. Let us draw two straight lines, parallel to the 
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given vectors, through an arbitrary point O (Fig. 86). These lines 
define a plane, which is parallel, not only to A and B, but to all mA 
and nB, where m and n are any numbers, and to their sum, by the 


addition rule, 
C= mA + nB. 


Conversely, any vector C parallel to the plane can be written in the 
form mA + nB. This may easily be seen by marking off C from O 
and representing it as the diagonal of a parallelogram with sides 
parallel to A and B. The above rela- 
tionship may be written in the more 
symmetrical form: 


aA + bB + cC= 0, 


and expresses the condition for three vectors to be coplanar, i.e. the 
fact that all three are parallel to the same plane. If A and B have the 
same or opposite directions, they are coplanar with any vector C, 
and we have to take c = 0 in the above relationship. 


102. Resolution of a vector into three non-coplanar components, 
Let A, B, C be three vectors not in the same plane. Any vector can 
be represented as the diagonal of a parallelepiped, the three edges 
of which are parallel to A, B, C. Thus, any vector can be expressed in 
terms of three non-coplanar vectors (Fig. 87): 


D =mA -+ mB + pC. 


It follows from this that a relationship exists between any four 
vectors of the form: 


aA + 6B+cC+dD=0. 


If the first three vectors are in the same plane, we have to take d = 0. 
The most important particular case of the above rule for resolving 
a vector into three others is when space is referred to rectangular 
coordinates XYZ and A, B, C are of unit length and are directed along 
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OX, OY, OZ respectively. They are then called unit vectors and 
are denoted by i, j, k. 
Any vector A can be written as 
A =mi +- nj + pk. (1) 


If A is taken from the origin of coordinates, the numbers m, n, p are 
the coordinates of its terminus and give the projections of A on the 
axes. We shall in future refer to these projections as the components 
of A along the axes and write them as Ax, Ay, A,. The above relation- 
ship can now be written as: 


A = A,i-+ A j+ A,k. (2) 
The projection of A along any direction n in space will be 
An = |A| cos (n, A) 
or, if we use the familiar expression of analytic geometry for the cosine 
of the angle between two directions: 
Án = |A| [cos (n, X) cos (A, X) + cos(n, Y ) cos (A, Y) + 
+ cos (n, Z) cos (A, Z)] = A, cos (n, X) + 
+ 4, cos (n, Y ) + A, cos (n, Z). 


When vectors are added, their components are obviously added (the 
projections of the closing side of the vector polygon are equal to the 
sums of the projections of the components). 


103. Scalar product. The scalar product of two vectors A and B is 
defined as the scalar equal to the product of the magnitudes of the vectors 
multiplied by the cosine of the angle between them. 

We write the scalar product as A - B, so that 


A - B = |A| |B| cos (A, B). (3) 
It follows at once from the definition that 
A-B=B.A, 


i.e. scalar products obey the commutative law. 
If A and B are perpendicular, clearly 


A-B=0. 
In particular, we have for the unit vectors: 


i-joj-k=k-i=0. 
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If A and B have the same direction, 


A-B = |A| |B], 
whilst if their directions are opposite, 
A-B = — |A| |B]. 
In particular, 
A-A = |A| = A}? + A} + A? (4) 
and 
i-f=j-jok-k=1. (5) 


The scalar product is given in terms of the components as follows: 


A-B = |A||B| cos (A, B) = |A! |B} [cos (A, X) cos (B, X) + 
+ cos (A, Y) cos (B, Y) + cos (A, Z) cos (B, Z)] = 
= |A] cos (A, X) |B| cos (B, X) + |A| cos (B, Y ) |B| cos (B, Y) + 
+ |A| cos (A, Z) |B] cos (B, Z) = A, B; + A, B, + A, B» 


(6) 


i.e. the scalar product is equal to the sum of the products of corresponding 
components of the vectors. 

It may be noted that the obvious independence of the left-hand 
side of the above equation on the choice of axes implies the not-so- 
obvious independence of the right-hand side on the axes. 

We have made use in (6) of the familiar expression of analytic geo- 
metry for the angle between two directions [102]. 

A scalar product may easily be seen to obey the distributive law, 
i.e. we have 

(A+ B)-C=A-B+B.C. (7) 

We can use the expression just obtained for a scalar product to write: 

(A + B)-C = (A, + B,)C, + (Ay + By) 0y + (A, + B2) C: = 
= (A,C, + AyC, + A,C,) + (B,C, + B,C, + B,C,) = 
=A-C+B.-C. ; 

The distributive property gives us at once the more general formula 

(A, + B,)- (Ag + B2) = A, - Ag + A,- B + B,- A, + B,-B,. (8) 


which expresses the ordinary rule for removing brackets when cross- 
multiplying. 


104, Vector products. We draw two vectors A and B through any 
point O of space and construct a parallelogram on them. The perpen- 
dicular through O to the plane of the parallelogram can have either 
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of two opposite directions. We take the direction with the property 
that, for an observer standing along it, the rotation of less than x 
required to turn the direction of A into that of B has the same sense 
as the rotation of 2/2 needed to turn the positive direction of OX 
into that of OY for an observer standing along OZ. The direction of 
the perpendicular is shown in Fig. 88 for right- 
and left-handed systems of axes. 
The vector product of A and B is defined 
_ as the vector of magnitude equal to the area 
” of the parallelogram constructed from A and 
B and directed along the perpendicular as 
above defined to the plane of the parallelogram. 
The vector product is generally written 
symbolically A x B. Its magnitude is, by 
definition: 


right 


left 


Fra. 88 


|A] [B|sin (A, B). (9) 


Its direction depends on the orientation of the axes, and if this is 
changed, its direction is reversed. 

The vector product is zero if A and B have the same or opposite 
directions. In particular, obviously: 


AXxA=0. 


We now consider the vector product of B and A. Its magnitude is 
clearly the same as for the product of A and B, whereas it has the 
opposite direction, since transposition of A and B means rotation of B 
only, and not A, in the reverse sense. Thus: 


BxA=—AxB, (10) 


whence it is clear that the commutative law does not apply; in fact, 
interchange of the factors in a vector product changes tts sign. 
We have the obvious relationships for the unit vectors: 


ixi=jxj=kxk=0; jxk=i; kxi=j; ixj=k. (l1) 
We next express the components of the vector product P = A x B 


in terms of the components of A and B. We can write, since A x B is 
perpendicular to A and B: 


P,A,+ P, Ay+P,4,=0, P,B,+P,B,+P,B,=0. 


We now use the following elementary algebraic lemma. 
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Lemma. The solution of the two homogeneous equations in three 


variables 
ax + by +cz=0; agz+by+tez=0 


has the form 
x = A(be, — cb); y= Aca, — aci); 2=A(ab, — ba,), 
where 4 is an arbitrary factor and it is assumed that at least one of the 


differences in brackets is not zero. 
We leave the proof of this simple lemma to the reader; it gives us Tt: 


P= (A, B: — A, By); Py = A4, By — A, Ba); 
P; = AA; By gi A, B,), 
where å is a coefficient of proportionality which must still be defined. 
For this, we use an important additional identity, generally known 
as Lagrange’s identity: 
(a? +b? + o?) (at + BF + ch) — (aa, + bb, + cc,)? = 
= (be, — cb)? + (ca, — ac,)? + (ab, — ba,)*, (12) 


which is easily verified by removing the brackets on both sides. We 
also notice that (PZ + P3 + P?) is the square of the length of P, i.e. 


a? [(Ay B, a A, By)? F (A, B, Ta A, B,)? F (A, B; =o Ay B,,)"] = 

= |A|? |B)? sin? (A, B). 

We apply Lagrange’s identity to the left-hand side of this last equation, 

which gives us 

2? (LAE + A5 + 43) (BE + By + Bi) — (A,B, + Ay By + A, B,)?] = 

= |A|? |B|? sin? (A, B), 

or, on using (4) and (6): 

A? [ JA]? |B]? — |A]? |B)? cos? (A, B)] = |A|? |B|? sin? (A, B), 
whence it follows at once that A = +1. 
We finally show that A = +1. We subject A and B to continuous 


deformation, such that A coincides with the unit vector i, and 
B with j. The deformation can be carried out in such a way that 


t If all three of the differences in brackets are zero, the angle between A 
and B is 0 or z, and AxB = 0, i.e. P, = Py = P,=0. 
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A and B do not vanish and do not become parallel to each other. 
Then A x B will not vanish and will finally become 
ixj =k, 
since A coincides with i, and B with j. 
Since the change is continuous and A can only have two values (41), 


we can say that å will not change during the deformation, and hence 
it must keep its final value. But we have after deformation: 


A,=1; Ay=A,=0; By=1; B,=B,=0; 
P,=1; P,=P,=0, 
and we can conclude from the relationship 


P, = (A, By — A, B,) 


that 2 = +1. 
We thus get the following expressions for the components of A x B: 
A,B,—A,B,; A,B,—A,B,; A,B, — A, By (13) 


By means of these expressions, the reader may easily verify that 
vector products obey the distributive law, i.e. that: 


(A-+B)xC=AXC+BxC. (14) 
Hence we find, on using (10): 
cx(A+B)=C x A+CxB, 
which leads on to the more general formula: 
(A, + Ag) x (B; + By) = 
= A, X B; + A, X B, + A, X B, + AX Ba}, (15) 


which is the exact analogue of (8) for scalar products. 


105. The relationship between scalar and vector products, We form 
the scalar product of the vector A with the vector product N = B x C: 


A-(BxC). 


The magnitude of N is given by the area of the parallelogram formed 
from B and C. But 


A-(BXC) = A-N = |A| |N] cos (A, N), 


and the scalar product can therefore be looked on as the product of 
area | N | of the parallelogram and the projection of A on the direction 
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of N, which is perpendicular to the area, i. the scalar product 
A -(B x C) expresses the volume of the parallelepiped formed from A, B, 
and C. Its sign depends on the orientation of the coordinate system. 
It is easily seen to be (-+) if the set of vectors B, C, A or, what amounts 
to the same thing, A, B, € has the same orientation as the axes. We 
can prove this by the method of continuous deformation used abovef. 

When referring to the volume of the parallelepiped, we took as its 
base the parallelogram formed from B and C. We might equally well 
have taken the base as the parallelogram formed from € and A or 
from A and B. We thus have the following relationships: 


A-(BxC) = B-(Cx A) =C-(AxB). (16) 


It only remains to notice the signs of these scalar products; they are 
the same, since the sets (A, B, C), (B, C, A), and (C, A, B) have the 
same orientation. The two latter sets are acquired from the first by a 
cyclic change. A different order changes the sign, e.g. 

A-(BxC) = —B.-(Ax(). (17) 

If A, B, C are coplanar, the volume of the parallelepiped is zero, i.e. 

A-(BxC) = 0. (18) 


This equation ts the necessary and sufficient condition for three vectors 
A, B, C to be coplanar. 
We now consider the vector product of A and B x C, ie 


D=Ax(BxC). 


Since D is perpendicular to B x C, it is coplanar with B and C, and 
hence [101]: 
D = mB + 7C; (19) 


but D is also perpendicular to A, and hence [103]: 
A-D=mA-B+7A-€C=0, 
so that 


m = pPA-C; n = — uA. B, 


t The dependence of the sign of A.(BxC) on the orientation of the axes 
comes from the dependence of BxC on the orientation. Hence A- (B xC) is 
not an ordinary scalar, the magnitude of which has to be independent of the 
choice of axes. In general, quantities that depend on the axes only to the extent 
that a change of orientation implies a change of sign are called pseudoscalara. 
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and finally 
Ax (BXC) =D = p{(A-C) B— (A-B)C}, 


where it only remains to find the coefficient of proportionality u. 
For this, it is enough to compare the components along one of the axes 
of the vectors on the left and right-hand sides of the last equation. 
We take OX parallel to A and find the components along OZ. On noting 
that with this choice of axes 


A,=|Al=a; A,= 4,=0, 
we have for the left-hand side [104] 
D, = A, (BXC), = a(B,C, — B,C,), 
and for the right [103] 
ulaC, B, — aB, C,), 


on equating which, we get u = 1. 
This leads us to the following expression: 


A x(BxC) = (C-A)B— (A-B)C. (20) 


As a corollary of this, we can resolve a vector B into two component 
vectors, parallel and perpendicular to a given vector A. We put C =A 
in (20) and re-write it as 


(A-A)B=(A-B)A—Ax(AxB) 

or 
B=B' +B’, (21) 
where 
,_ A-B i. pr _ AX (AxB) 

B=7 4 B= Ser ay Ga 
which gives the required result, since clearly, B’ is parallel to, and B” 
perpendicular to, A. 


106. The velocities at points of a rotating rigid body; the moment of a vector. 
Vector products have important applications in mechanics, primarily in the 
dynamics of a rigid body.t 

We first consider a rigid body rotating about a fixed axis (L). In this case, 
every point M of the body has a velocity v which is equal in magnitude to the 
product of the distance PM of M from the axis of rotation (Fig. 89) and the 
angular velocity œw of the rotation, and which has a direction perpendicular 
to the plane containing M and the axis of rotation. The velocity v can be repre- 
sented geometrically as follows. We choose the direction along (L) with respect 


+ We use a right-handed system of axes in future. 
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to which the rotation is counter-clockwise and call this positive. If we mark off 
a length equal to w in this direction from an arbitrary point A of the axis, 
we get a vector o called the angular velocity vector. We let r denote the vector 
along AM and use the definition of vector product; this gives us the simple 
expression for v: 

v=0XT, 


since the magnitude of ox r is equal to 
|r| |o! sin (r, 0) = w- |MA|- sin p = œ |MP]| = |v], 


whilst its direction is the same as that of v. 

We know from kinematics that, in the case of the motion of a rigid body 
about a fixed point O, the velocity of any point of the body at any given 
instant is the same as it would be if the body rotated about an axis (the instan- 


M 
r F 
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taneous axis) through the point O with an angular velocity w (the instantaneous 
angular velocity); generally speaking, the position of the axis and the value 
of w change with time. By the above, the velocity of a point of a rigid body is 
given at any given instant by the vector product of the instantaneous angular 
velocity and the vector OM. 

We take a second example. Let a force represented by the vector F be applied 
at the point M, and let A be a given point of space (Fig. 90). 

We define the moment of the force F about the point A as the vector product 
F xr, where r is the vector with origin at M and terminus at A. 

Let AP be the perpendicular from A to the line along which F lies. We get 
from the right-angled triangle AMP: 


[AP] = |r| |sin (r, F)| 
so that the magnitude of the moment of the force F about A is 
|r| IF| sin (r, F)| = [F] [AP], 


i.e. it is equal to the product of the magnitude of the force and the distance of the 
point A from the line along which the force acts. The direction of the moment is 
found by using the above rule for the direction of a vector product. 

It follows from what has been said that the moment of a force does not vary 
with movement of the point of application M along the line of the force. The 
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definition of the moment of a force about a pont can evidently be generalized 
for the case of any vector. 

We now find the components of the moment. Let (a, b, c) be the coordinates 
of A and (x, y, z) the coordinates of M. The components of r are: 


a— x, b—y, c—z. 

We use the expressions for the components of a vector product [104] to 
obtain the components of the moment as follows: 

(y — b) F,—(¢-e) Fy; (@-0) PF, —(e@—a) Fy (x —a) Fy —(y— 0) Fy. 

Returning to the example of a rigid body rotating about an axis, we can 
say that the velocity of a point M of the body is equal to the moment of the 
angular velocity vector about M. If we take the coordinates of M as (x, y, z), 
whilst (x9, Yo Zo) is the origin of the angular velocity vector, with components 
Ox, Oy, O,, we have the following expressions for the components of the velocity 
of M: 

(2 — 25) Oy — (Y — Yo) Oz; (Œ — £.) Oz — (2 — 20) Ox; (Y — Yo) Ox — (Œ — Bp) Oy. 

We now define the moment of a vector about an axis. Let A be a straight line 
in space to which a definite direction is assigned (the axis). 

The moment of a vector F about the axis A is defined as the algebraic value of 
the projection on the axis of the moment of F about any point A of A.t 

The definition is justified by showing the independence of the projection of 
the position of A on A. We take 4 as axis OZ and let (0, 0, c) be the coordinates 
of A and (zx, y, z) the coordinates of the origin M of F. With this choice of axes, 
the projection on A of the moment of F about A is the same as its component 
along OZ, and is equal, by the above formula, to 

xFy — yF,, 


since a = b = 0. This is independent of c, i.e. of the position of A on A. 


§ 11. Field theory 


107. Differentiation of vectors. We extend differentiation to the 
case of a variable vector A(z), depending on a numerical parameter T. 
We shall take the vector from a fixed point, say the coordinate origin 
O (Fig. 91). As t varies, the terminus of the vector A(t) describes a 
certain curve (L). Let the positions of the vector for the values of the 
parameter (r + Art) and t be respectively OM, and OM. The straight 
line MM, corresponds to the difference A(z + At) — A(t), and the 


ratio 
A(t + dt) — A(t) 
“At 


t The moment of a vector about a point is a vector, whilst the moment 
about an axis is not. 


107] DIFFERENTIATION OF VECTORS 311 


gives us a vector parallel to MM. The limiting position of this vector 
as At — 0, if it exists, represents the derivative 


= lim seu A(t) ; (22) 
This derivative is evidently a vector directed along the tangent to 
the curve (L) at M. It also depends on 7, and its derivative with 
respect to t gives a second derivative d?A(t)/dt? and so on. 
We can write A(r) in terms of the 
unit vectors i, j, k: (L) 


A(t) = A, (t)i + Ay (t)j + A, (t)k. 


ene 


Definition (22) now gives 


dA(t) _ dAx(t), dA, (1) 


då, (T). 
dr dt a de j + dr k A(T+ AT) 
(23) Fie, 91 
and in general 
d™ A(r) a™ A(t). d™Ay(z). d” A, Ok 
d gym” t an ear dı” i (231) 


i.e. differentiation of a vector amounts to differentiation of its components. 

The familiar rule for differentiation of a product may be extended 
to the product of a scalar and a vector and to scalar and vector pro- 
ducts, in accordance with the formulae: 


-$ Aam) = YO Aa) + py) AO (24) 
T Alt) Bir) = sae + B(r) + A(t): (24,) 
F Alt) xB(r) = “AO. x Ber) + Ale) x BO, (245) 


where f(z) is a scalar, and A(z), B(z) are vectors, eee on T. 
We verify say (24,); the left-hand side may be written: 


E {4 (1) B, (T) + Ay (1) B, (T) + A, (1) B, (2)} = 
= O Be) HO B, (o) + OB, (o) + Ae) RO + 


dal dB; (1) 


+ A, (1) + A, (t) 29 


The same result may be an seen to be obtained for the right-hand 
side. Of course it is assumed that the derivatives exist. The existence 
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of the derivatives in the factors of (24), (24,), (24,) implies the existence 
of the derivative of each product [cf. I, 47]. The proof of the ordinary 
rule for differentiating a vector sum is purely elementary. If a point M 
moves along a curve (L), the radius vector r of M is a function of time t. 
We get the velocity vector of the point on differentiating the radius 
vector with respect to ¢: 

dr ds dr 


rsa a a (25) 


The length of this vector is given by the derivative of the path s 
with respect to time t, whilst its direction is tangential to (L). The veloc- 
ity vector also depends on time and we can differentiate it to get the 
acceleration vector w = dv/dt. 

If we take the length s of the curve as the independent variable, 
the derivative of x with respect to s is represented by the unit vector along 
the tangent t=dr/ds. We saw in [I, 70] that /Aa?+ Ay2/4s—1, that is, 
the ratio of length of chord to length of corresponding arc tends to 
unity. The same is evidently true for curves in space [I, 160]. It 
follows at once from this and definition (22) with r = s that the length 
of the above tangential vector is in fact unity. 


108, Scalar field and gradient. If a physical quantity has a definite 

value at every point of the whole or part of space, a field of the quan- 

tity is defined. The field is said to be sca- 

(t) lar, if the quantity is scalar (temperature, 

pressure, electrostatic potential), or vector 

[100], if the quantity is a vector (veloc- 
M: ity, force). 

We start with a scalar field, which is 


M defined simply by defining a function of 
My a point U(M) = U(x, y, 2). 

M i) A heated body, for instance, gives a scalar 

temperature field. The temperature U (M) 

Fre. 92 has a definite value at any point M of the 


body, and can vary from point to point. 

We take a definite point and draw a straight line through it with an 
assigned direction (l) (Fig. 92). We consider the value of U (M) at the 
point M and at a neighbouring point M, on (l). The limit of the ratio 


U(M,) — U(M) 
MM, 
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is called the derivative of the function U(M) with respect to the direction 
(l) and is written as: 
OUM) iim TM) — UM) 
əl M;>M MM, d 


(26) 


This derivative characterizes the rapidity of variation of U(M) at 
the point M in the direction (l). The function thus has an unlimited 
number of derivatives at any given point; but it is easily shown that 
the derivative with respect to any given direction is expressible in 
terms of the derivatives with respect to the three mutually perpendic- 
ular directions X, Y, Z in accordance with: 


M 0U(M 0U(M 
aut _— M) cos (1, X) + s ) oos (2, ¥) + 


sum) 


+ cos (1, Z). (27) 


We remark first of all that we could have formed derivative (26) by 
taking any directed curve (L) through the point M instead of a straight 
line (Fig. 92). Instead of (26), we should have had to consider the limit 


. U(M,) — U(M) 
lim ea) UM 
M; >M MM, 


This is, in fact, clearly the derivative of U(M) with respect to the 
length of arc s of the curve (L) and we can write, on using the rule 
for differentiation of a function of a function: 


U(M,)— U(M) _ 9U(M) dz | @U(M) au 


kaaa U MM, ðr de T oy + 
ƏU(M) dz 
+ x L. ‘ds ° (20) 


But we know from [I, 160] that dx/ds, dy/ds, dz/ds are the direction- 
cosines of the tangent to (L) at M and when (L) is a straight line, we 
again get (27). Furthermore, (28) shows that the derivative with 
respect to the curve is the same as the derivative with respect to the 
tangent (m) to the curve at M. 

We now bring in the idea of level surfaces of a scalar field. These 
surfaces are characterized by the condition that U(M) has the same 
constant value C at all points of the surface. On assigning different 
numerical values to the constant, we get a family of level surfaces 
U(M) = C, with a definite surface passing through any given point 
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of space. In the case of a heated body, the level surfaces are those of 
constant temperature. Let (S) be the level surface passing through 
the point M (Fig. 93). We take three mutually perpendicular directions 
through M: the normal (n) to (8), and two directions (4), (t) lying 
in the tangent plane. The directions (t), (t) will be tangential to curves 
(L), (L) respectively, lying on the level surface. Since U (M) is constant 
along these curves, we have: 


ƏU(M) _ OU(M) _ 
0, dk =A, (29) 


We now take any direction (l). We use (27) with (n), (f,) and (t) as 
the three mutually perpendicular directions and take (29) into account, 
which gives: 


3U (M) _ IUM) 


al on 


oa If we draw a vector along (n) of al- 

js gebraic magnitude 3U (M)/Ən, its pro- 
jection on any direction (l) gives the 

derivative 0(U)M/al, by (30). 

The vector thus drawn is called the 
gradient of U(M), i.e. the gradient of a 
scalar field is, by definition, the vector 
field obtained as follows: the vector at 
any point is along the normal to the 
corresponding level surface, whilst its algebraic magnitude is equal to the 
derivative of U(M) with respect to the normal. The gradient of the 
scalar field U(M) is written grad U(M), and we can write (30) as: 


avan 


(ts) 


cos(l, n). (30) 
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= grad, U (M), (31) 


where grad, U(M) is the projection of the vector grad U (M) on (}). 

It is easily seen that the choice of the direction of normal (n) to the 
level surface (8) does not affect the direction of grad U (M). The latter 
vector is always in the normal direction to (S) in which U(M) is 
increasing. 


Examples. 1. The gravitational field discussed in [87] leads to a scalar field 
of the gravitational potential 


von = p peapa. 
©) 
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where „(M,) is the density of the matter occupying volume (v) and r is the 
distance of the point M from the variable point M, of integration. We had the 
following expressions for the components of the gravitational force: 


_ @U(M) 8U(M) p, — UM) 
~ ds” Oy 


Á y=- * Paoa > 


x 


where F,, F,, F, are the components of the vector F. It follows at once that, 
in general, F, = 0U(M)/dl, i.e. the vector field of the gravitational force is 
the gradient of the potential U(M). The work done by the force is given by 


(B) (B) 
f F, dx + Fy dy + F; dz = f dU(M) = U(B) — U(A), 
(A) (A) 


i.e. the work is given by the potential difference between points A and B. 

Every conservative field of force, i.e. every field for which F = grad U(M), 
clearly has this property. We often call — U(M) the potential, instead of U(M) 
itself. 

2. If different points of a body have different temperatures U(M), movement 
of heat will take place in the field from hotter to cooler points. Let dS be a 
small element about M of any surface through M. We find from thermodynamics 
that the quantity of heat crossing the element dS in time dt is proportional 
to dt dS and the normal temperature derivative 0U(M)/@n, i.e. 


AQ =kdtds ea j (32) 


where k is the coefficient of proportionality called the internal thermal con 
ductivity and (n) is the normal to dS. 

We now draw the vector —k grad U(M), called the heat flow vector; the (—) 
sign is used because heat flows from points of higher to lower temperature, 
whilst grad U(M) is along the normal to the level surface in the direction of 
increasing U(M). We can say by (32) that the quantity of heat AQ passing in 
time dt through the element dS is given by: 


AQ = k dt AS |grad, U (M)]. (33) 


109. Vector fields. Curl and divergence. We now consider a vector 
field A(M), obtained when a vector A(M) is defined in magnitude and 
direction at every point of the whole or part of space occupied by the 
field. In fluid flow, for instance, we have a vector field of the velocity 
v at any given instant. 

We define a vector line of the field as a curve (L) such that the 
tangent at every point has the direction of A(M) (Fig. 94). It may 
easily be seen, as in [22], that a vector line has a differential equation 
of the form 


Ag, Ags ee? (34) 
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where the components Ax, Ay, A, are known functions of x, y, z. If the 
conditions of the existence and uniqueness theorem are observed, it 
follows from this that one and only one vector line passes through 
any point Mt. If we draw all the vector lines through all the points 
of a piece of surface (8), their aggregate gives us a vector tube 
(Fig. 94). 


A(M) 


_ Let (v) be a volume in the field bounded by the surface (S) and let 
(n) be the direction of the outward normal to (S) from (v). We find on 
applying Ostrogradskii’s formula [63] to functions A,, Ay, A,: 


ME +e + ae 


= f a cos(n, X) + Ay cos (n, Y) + A, cos (n, Z)] dS 
(S) 


or [102] 


f nies aa + oe + os =f) de =| { Anas. (35) 
(v) (sj 


The surface integral on the right is usually called the flux of the field 
through the surface. Its physical significance is explained later. The 
integrand of the volume integral is called the divergence of the vector 
field and is written 


, 3A aA Os 
divA = aan + By + : (36) 


t The conditions of the theorem are certainly satisfied if A,, Ay, 4, are con- 
tinuous and have continuous derivatives, whilst A(M) differs from zero at 
M, i.e. at least one component differs from zero [51]. 
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We can thus write Ostrogradskii’s formula as 


C div À dv = AdS, 

ifs {fn on 
i.e. the volume integral of the divergence is equal to the flux of the field 
through the surface of the volume. Definition (36) of divergence is related 
to the choice of axes X, Y, Z, but an independent definition is easily 
obtained with the aid of (37). Let (S,) be the surface of a small volume 
(v,) surrounding the point M. If we apply (37) and use the mean value 
theorem [61], we can write 


{f Anas 
div A| -v,=[[AndS, ie dival, =. 
M, > M, Ti 
(S3) 
where the value of div A is taken at a point M, of volume (v,), the 
magnitude of which is denoted by v,. On indefinite contraction of the 
volume to the point M, M, tends to M, and the above expression 
gives in the limit the value of the divergence at M: 


A, aS 
divA= lim ints i (38) 
(i) >M vı 
i.e. the divergence of the field at the point M is the limit of the ratio of the 
flux through a small surface surrounding M to the volume enclosed by this 
surface. 

The above discussion shows that every vector field A gives rise to a 
scalar field div A, i.e. the field of its divergence. We show next that, by 
using Stokes’ theorem, we can generate a unique vector field from 
the initial field A. We take 


P=A,; Q= 4; R=A,, 


and write Stokes’ formula [70] as: 


0A 0A 
[Ayaz + Aydy + A,dz ={f [( By — S2) cos (n, X) + 
1) (S) 
0A, 0A 0A aA 

+ ( a Ba) cos (m, Y)+ (52 — ra cos (n, 2)| ds. (39) 

Let ds be a directed element of arc of the curve (l), i.e. an element 
of arc considered as a small vector. Its components on the axes will be 
dz, dy, dz, and the expression under the line integral sign con- 
sists of the scalar product A - ds, i.e. is equal to A, ds, where A, is the 
projection A on the tangent to (l). 
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We introduce another vector, with components equal to the differ- 
ences appearing under the double integral sign. This vector forms the 
new vector field and is called the curl of field A, being written curl A 
thus: 


. __ ðA: Oy, , _ 04, 04, | 2 ðAy GAs 
curl, A = By eS curl A = a ee curl, A = 3 > — Jy 
(40) 


We can now write (39) as: 


fA, ds = f f [curl Acos(n, X) + curl,Acos(n,Y) + curl,Acos(n, Z)]d8 
© (S) 
or 
f A,ds = f f curl, AdS, (41) 
(0) (S) 
where curl, A is the component of curl A along the normal (n) to (8). 
The line integral on the left is usually called the circulation of 
vector A round the contour (l) and Stokes’ formula can be stated thus: 
the circulation of the field round the contour of a given surface is equal to 
the integral over the surface of the normal component of the curl, i.e. 
equal to the flux of the curl through the surface. We can define curl 
independently of the choice of axes with the aid of (41). Let (m) be a 
given direction through the point M, and let (o) be a small plane 
surface through M normal to (m). We apply (41) to this area and use 
the mean value theorem: 
. f A,ds 
f A,ds = curlm Alm, +0, ie. curl, Aly, =“ —, 
@) 

where (A) is the contour of (a) and M, is a point of the area. On indefi- 
nite contraction of the area to the point M, we get in the limit, as in 
the case of divergence, the value of the component of curl in any given 
direction (m) at the point M: 

f A,ds 


curl, A= lim ® _. (42) 
(o0}>M (ed 


Numerous examples will occur later of applications of curl and 
divergence, when the physical meaning of these will be explained. 


110. Lamellar and solenoidal fields. We obtained in [108] the vector 
field grad U (M) as the gradient of a scalar function U (M). Such a field 
is called a lamellar field. Of course, not every vector field is a lamellar 
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field, and we now give the necessary and sufficient conditions for the 
field to be lamellar. The relationship A = grad U(M) is equivalent 


to [108] 
aU. 4 _U, 4, _ 


Ox’ = 


A, = y Cy’ zoe? 


i.e. to the fact that the expression 
A, dx + A,dy + A,dz (43) 


is the total differential of a function. We saw in [73] that there are 
three necessary and sufficient conditions for this: 
04, O4y_,, BAr O42 _ p, Ody Ay _ 


oy dz » “Bz Oc ° os y ve 


which are in turn equivalent to the vanishing of the curl of the field: 
curl A = 0, ie. the necessary and sufficient condition for a vector field 
to be lamellar is that its curl vanishes. If this condition is satisfied, the 
potential is defined by [73] as a line integral: 


(™) M) 
U(M)= f A dr + 4,dy + A,dz= | A,ds. (44) 
(Mo) (Mo) 
With this, A = grad U (M), and [73] 
e A, = (grad, U (M)ds =U (B) — U (A). 
(A) (A) 


We can suppose. that expression ‘ast whilst not itself a total differ- 
ential, has an integrating factor, i.e. there ‘exist the function Bae! ) 
of the point M such that 


H(A, dx + A, dy + A, dz) = dU (45) 


is a total differential. We speak of a field of this sort as compound 
lamellar. We saw in [76] that the characteristic of such a field is the 
existence of the family of surfaces U(M) = C orthogonal to a vector 
line of the field, whilst it follows from (45) that uA = grad U or 


A = ~=- grad U, 


i.e. the field A in this case differs from the lamellar field by a numeri- 
cal factor 1/u, with different values at different points of space. 
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The necessary and sufficient condition for a compound lamellar 
field is given by [76]: 


e 4, (Qe — a —o 


which can be written as 
A-curl A = 0, (46) 


i.e. the necessary and sufficient condition for the existence of a family of 
orthogonal surfaces to a vector line of the field is given by (46), i.e. 
the vectors A and curl A are perpendicu- 

lar or else curl A vanishes. 
If the space occupied by the field is 
multiply connected, the field potential 
Fre. 95 defined by (44) may be a many-valued 

function. 
We investigated above a vector field, 
the curl of which is zero, and discovered that the field is lamellar. A 
field for which the divergence is zero, i.e. where we have identically 
div A = 0, is called solenoidal. We have by (37) for this sort of field: 
ff[A,d8 = 0, (47) 
_ &) 
where (9) is any closed surface inside which the field exists everywhere. 
Let (S) be the part of a vector tube between its two sections (8,) 
and (S,) (Fig. 95). Since A lies in the tangent plane to the lateral part 
of (S), we have here A, = 0. If we take the normal (n) for both the 
sections (S,) and (S,) in the direction of movement along the tube, we 
shall get the inward normal in the case of (S,) and the outward normal 
for (S,) with respect to the piece of tube (3). We have on using (47): 
{f 4ds — f {4ds =0, 
(S,) (S) 


where the integral over (.8,) has the (—) sign since (n) is in the opposite 
direction to the outward normal. This equation shows that 
ff Anas (48) 
(S) 
has the same value for every section (S) of a vector tube in the case 
of a solenoidal field. The integral represents the flux of the field through 
the section (S) and is usually called the strength of the vector tube at 
section (S). Hence we can say that the strength is the same at all sections 
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of a vector tube in the case of a solenoidal field. If the cross-sectional 
area increases on moving along a tube, i.e. the tube expands, the value 
of the flux intensity A, generally speaking diminishes, so that the 
value of integral (48) remains unchanged. 


111. Directed elementary areas. We now bring in directed elementary 
areas dS, corresponding to the directed elementary arcs of [109]. We 
suppose that two sides have been distinguished of a given surface, 
such that there are two opposite normal directions at any point of the 
surface, depending on which side is taken, and where continuous 
variation of either one direction is implied by continuous motion over 
the surface [64]. If the surface is closed, we get the inward and outward 
normals with respect to the volume enclosed by the surface. We define 
a directed elementary surface dS as the vector of length equal to the area dS 
of the elementary surface and with direction along the normal (n) to the 
element. We agree to take (n) as the outward normal in the case of a 
closed surface, where the inward normal will be written (n). 

The projections of dS on the axes give us the projections of the ele- 
mentary area on the corresponding coordinate planes, with the sign 
plus or minus, depending on whether the angle between (n) and the 
axis concerned is acute or obtuse. 

Let f(M) and A(M) be scalar and vector functions defined on the 
surface (S). We take the expressions: 


P AM) aS (49) 
| f A(M)-dS (49,) 
S 
Ja A(M) x dS. (49,) 


The first is a vector with the components: 
$ f At) cos (n, X) a8; § { ADD) cos (n, Y) d8; 
(S (S) 
5 § AM) cos (n, Z) as. 
(S) 
Expression (49,) is a scalar: 


J {A-d8 = J f 4,48 
(S) (S 
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whilst finally, (49,) is a vector with components: 

SJ [A, cos (n, Z) — A, cos (n, Y )}] d8, 

SJ [ A, cos(n, X) — A, cos (n, Z)] dS, 

[f [4,008 (m, Y) — A, cos (n, X)] A8. 


Let (S) be closed and let (v) be the enclosed volume, /(J/) and A(M) 
being defined throughout the volume. By applying Ostrogradskii’s 
formula, we can easily verify the following three equations: 


Jf tas = S f S grad dv (50) 
JJ Aas = J f Sdiva dv (50,) 
fJ Axas = — f f f curl A dv. (50,) 


Equation (50,) is the same as (37). Equation (50,) is verified as 
follows. The components of the left and right-hand sides along the x 
axis are given by the integrals: 


f [[4y cos (n, Z) — A, cos (n, Y)] dS; — E = 2r) dv, 
(S) v 
which are easily seen to be of the same magnitude by transforming 
the double integral with the aid of Ostrogradskii’s formula [63]. 
Similarly, by using Stokes’ formula and a directed elementary area, 
we can write the following expressions: 
f fds = — f f grad f x dS (51) 
® (S) 
f A-ds = f f curl A- ds. (51,) 
9 (S) 
Here, (8) is a given surface and (l) its contour. The second formula is 
the same as (41), since curl A- dS = curl, 4 * dS by definition of 
scalar product. The components along OX of the left and right-hand 
sides of (51) are: 


0 ə 
| fae; -f {lee cos (n, Z) — ÊL cos (n, Y)] dS ; 
(0) (S) 
these expressions are easily shown to be equal using (22) of [70]. 
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112. Some formulae of vector analysis. We prove certain relationships 
between the vector operators introduced. We saw in [110] that the 
curl of a lamellar field is zero: 


curlgrad U = 0. (52) 
A rotational field is easily seen to have zero divergence: 
div curl A= 0. (53) 
We have, in fact: 
0A 0A, 0A 0 (0A CZ: pa __ 
div oul A= = (% z SY) +4 By (S at) + oe (at — Be) =o. 
We next consider the divergence of a lamellar field: 
div grad U = Č grad, U Far ~= grad, U + = aa grad, U, 
or 
. 2U &U 2U 
div grad U = > t oye + oe" (54) 
. The differential operator 
eU eU eU 
AU = ga t 47 Dy T Gar (55) 


is called Laplace’s operator. From the left-hand side of (54), it is clearly 
independent of the choice of axes. On applying (38) to the vector 
grad U, we get AU at the point M defined as: 


ffas 


AU = lim $. (56) 


()->M "1 


We have defined AU for the case of a scalar U. The symbol AA, 
where A is a vector field, denotes the vector with components 4 Ax, 
AA,, AA,. We now prove the following formulae: 


curl curl A = grad div A — AA, (57) 
div (fA) = fdiv A + grad f A, (57,) 
div A xB = B-curlA—A-curlB, (575) 
curl fA = grad fx A+ f curl A, 573) 


Alpy) = y åp + p åy + 2 gradọ - grad y. 
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We shall only prove the first of these, leaving the remaining proofs 
to the reader. We take the component along OX of the vector on the 
left-hand side of (57) and show that it coincides with the component 
of the vector on the right: 


0 G) 
curl, curl A = a curl, A — p curl, A = 


) (= E yx) — 3 (“2 0A, 
dz Ox ) 


= Oy oy oz ~ 


whence, on removing the brackets and adding and subtracting 67.4,/02?: 
a (aA, < 1 OA, 2! 
( a) — Cans + at + a) = 


Ox \ Ox 
=- divA—AA 
~~ “Ox x 


curl, curl A = 


which is what we wished to prove. It may be remarked here that the 
independence of AA on the choice of axes follows from (57), since 


AA = grad div A — curlcurl A. 


113. Motion of a rigid body and small deformations. We saw in [106] that, 
in the case of rotation of a rigid body about a point O, the velocity of any’ 
point was given by: 

vomoxr, 


where o is the instantaneous angular velocity vector and r is the radius vector 
OM. 

We get the most general case of motion of a rigid body on giving it in addition 
a translation of velocity vo, its total velocity being now: 


v=v +oxXr. (E8) 


Conversely, given the velocity field v, the angular velocity vector is found 
as follows. First of all we notice that the vectors v, are the same at any given 
instant at all points of the body, so that they are independent of x, y, z. Hence 
we have by (40), curl v, = 0. 

Let p, q, r be the axial components of o, and let O be the origin. The com- 
ponents of the vector product oXr will be [104]: qz — ry, ra — pz, py — qa, 
so that by (40), the components of curl o x r are 2p, 2q, 2r, whence the angular 
velocity vector is given in terms of v by 


o= aun curl v. (59) 
2 
This is the reason for curl v being alternatively called rot v, i.e. the rotation 
of the velocity vector. 
If we multiply v by the small element of time dt, we get a vector v dt which 
gives the approximate displacement of a point of the body in time dt. We thus 
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get a vector field of displacements of points of a rigid body: 
A=vdt. 


On returning to (58) and assuming that translation is absent, i.e. that the 
point O is fixed, we get the following expression for the displacement vector: 


A=o,Xr, (60) 


where o, = o dż is a small vector directed along the axis of rotation and equal 
in magnitude to the small angular displacement in time dt. Let the components 
of this vector be p,, qı, 7,, and let (x, y, z) be the coordinates of a variable point 
of the rigid body. The components of A will be: 


4,=q2—ny; Ay=ne— pz; 4,= Py — Ue. 


Hence, as above, the vector of small rotations is easily found in terms of 
the displacement vector in the form: 


o = L curl A. (61) 
Furthermore, the previous formulae show that the components of A are linear 
homogeneous functions of the coordinates (x, y, z). 

We now consider the general case of a linear homogeneous deformation 
in which the components of the displacement vector are linear homogeneous 
functions of the coordinates: 


A, = aye + biy + oez 
Ay = agt + bry + eg (62) 
A, = aze + byy + eg. 


We shall assume that coefficients a, b, and c are small, and shall confine our- 
selves to the case of a small volume (v) near the origin. Every point of the volume 
will be displaced by the vector A and its new coordinates will be: 

, E= r+ Ay n =y + Ay; =z + A,, 

i.e. 
= (1 +a) z + biy +e | 
n= ayt + (1+ bay tog (63) 
C= a,x + byy + (1 + 5) z. J 


This transformation will amount to a rotation of (v) as a rigid whole about 
O only in particular cases. In the general case, it will be accompanied by de- 
formation of the volume, i.e. with a change in the distances between its points. 
We shall explain this in more detail. 

The components of the curl of the displacement vector A are by (62): 6, — cy, 
cı — Q}, a, — b,. If the transformation reduced to rotation of the elementary 
volume as a whole, we should have a displacement vector AC) with components 


I 1 1 l 
x =- (¢, — 45) z 2 (a, — bi) y; AY) = D (4, = by) £ — =) (bs — c3) 25 


1 1 
A®) 3 (ba — ca) Y z (a az. 
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On subtracting this vector from A, we can represent the latter as 
A = A® + AQ, (64) 


where the vector of pure deformation AC?) has the components: 
I 1 
AD = aye + -y (bi +a) y+ (C1 Ha) 
AM = : 
=- 1 +a) t + bzy + -y (Ce + ba)? (65) 
I 1 
A® 2 y (a +a) z + (Ce + b) y + cz. 
It is easily seen that this latter is a potential vector, and in fact: 
I 
AQ) — -5 grad [a,x + by? + c32? + (by + a,) zy + (c1 + a3) 22 + (c3 ie ba) yz], 


whilst its curl is evidently zero. 
We now find the change in an elementary volume as a result of deformation. 
The new volume after deformation will be given by: 


v, = f f f dë dn a. 
(») 


If we carry out the change of variables in accordance with the formula of 
[60], we must write: 


didn df = {a +a) [a +b) (1 +e) _ cabs] + b, [e245 —a,(1+ AJ + 
+ c, [a,b, — (1 + b) ay]} dæ dy dz. 


On removing the brackets and keeping only the free terms and first powers 
of the small a, b, c, we get: 


d$ dy df = [1 + (a, +b: + ca)] da dy dz, 
and the above formula gives: 


Oc SA) De batt) dp dy de mee Osh bees) 


where v is the volume before deformation. The dilatation is 


vı —v 


ae = a; +b: + c3, 


whilst the sum on the right is easily seen, by (62), to be div A, i.e. the divergence 
of a displacement field gives the dilatation. 


114. Equation of continuity. Let v denote the velocity of flow of a 
fluid, and let us find the quantity of fluid flowing through a given 
surface (S) (Fig. 96), a small element of which is dS. The particles 
occupying dS at time t move by an amount v dé in time dé; thus the 
quantity dQ of fluid which passes through d9 in time dé occupies a 
cylinder of base dS and generator v dt. The height of the cylinder is 
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clearly v, dt, where v, is the projection of v on to the normal (n) to the 
surface, so that 

dQ = ov, dt dS, 
where ọ is the density of the fluid. A negative dQ will be obtained if the 
angle (n, v) is obtuse. In the case of a closed surface, (n) is the direction 
of the outward normal, and a negative dQ is obtained if fluid flows 
into the volume enclosed by the surface through dS. The total quantity 
of fluid flowing through the surface per unit time will be: 

Q = f fords, (66) 

(S) 
where fluid flowing in is understood to take the minus sign. 
The quantity of fluid occupying the volume (v) bounded by (8) is 


given by 
Dee 
(») 


and this changes in time dt by an 


amount 
de { | de diy 
(2) 


> 


so that the increment in the quantity 
per unit time is 


í o 
iE 
(v) 


whilst the quantity flowing out is given by the same integral with the 
reverse sign; hence we obtain two expressions for Q: 


o= [feras = — [ff an 
or, on using (37): i 


Q= jars ev) masa do, 


where the density ọ is kept under the divergence sign since it may be 
variable, i.e. depend on the position of the point. The last expressions 
give us an equation which is valid for any volume inside a fluid: 


JSS [a + div (ov) |dv =o. 


Fia. 96 
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It follows that the integrand must be identically zerot, and we get: 
2e 1. div (ev) =0. (67) 


This important relationship between the density and velocity of flow 
of any fluid, compressible or not, is known as the equation of conti- 
nuity. It may be written in an alternative form by considering the 
change in density of the fluid particle occupying the position (x, y, 2) 
at the instant t. 

We take o(t, x, y, z) as the fluid density at the point (x, y, z) at time 
t, and consider the change in density of a particle. As the particle 
moves, its density depends on t both directly and via (x, y, z), since its 
motion implies alteration of its coordinates. The total derivative of 9 
with respect to ¢ is: 


doe 9 do dx Nes 
‘di Ot a a he , 


which can alternatively be written as 


de 3ə 0e de 
te ee a? yt Ee 
or 
es ee tad o.v (68) 
da a TE oxy 


We can use (57,) to write (67) in the form 


a + grado-v + edivv = 0, 
i.e. by (68): 
2 4 odivv =0, (69) 


whence 
i — 1 de 
div v= — z aes 
The divergence of a velocity field v thus gives the relative change 
in density per unit time of a fluid element situated at a given point. 
If the fluid is incompressible, this change must be zero, and (69) 


gives us the condition for incompressibility: 
divv = 0. (70) 
+ We proved in [71] that, if a double integral over any domain is zero, the 


integrand must be identically zero. The same proof may be used for triple 
integrals. 
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We deduced the continuity condition by finding two different ex- 
pressions for the quantity of fluid flowing out of a volume. It is 
naturally assumed here that there are no sources in the volume, either 
positive or negative (sinks). 

If the flow is irrotational or, in other words, potential, i.e. v is the 
lamellar vector: 

v = grad p, 
we call ọ the velocity potential. We get on substituting in (70): 


Op Fp Bp 
Gat t By + Gee = O> ay 


div grady=0, ie. 


i.e. the velocity potential must satisfy Laplace’s equation (71) in the case 
of an incompressible fluid. 


115. Hydrodynamical equations for an ideal fluid. We shall understand by 
ideal fluid a deformable continuous medium such that the internal forces, 
whether the medium is in equilibrium or in motion, reduce to a normal pressure: 
so that if we distinguish a volume (v) of the fluid bounded by a surface (8), the 
action of the remainder of the fluid amounts to a force directed along the inward 
normal at every point of (S). We let p denote the magnitude of this force 
(pressure) per unit area. The pressure p(M) at any given instant gives usascalar 
field. The resultant of the pressure on the surface of (v) at a given instant gives 
us a scalar field. The resultant of the pressure in the surface of (v) is given by 


using (50): 
- ff pas = — f f f grad pan, 
(S) (2) 

where the (—) sign is taken because a positive pressure acts along the inward 
normal, the vector dS being along the outward normal by hypothesis. 

By d’Alembert’s principle, the pressure must be in equilibrium with the 
external forces, which we denote by F per unit mass and which yield the re- 
sultant for volume (v): 


Sis o Fdv, 


to which must be added the inertia force on an elementary mass — edv W, 
where ọ is the density and W is the acceleration of the fluid particle, so that the 
inertia force for (v) becomes 


—S SS eW do. 
©) 
d’Alembert’s principle thus gives us: 
f f f {oF — grad p — pW] dv = 0, 
(v) 
whence we can conclude, by the arbitrariness of (v) as above, that the integrand 


is zero, i.e. 
oW = oF — grad p. (72) 
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This expression embraces the three fundamental equations of the hydrody- 
namics of an ideal fluid. 

Let u, v, w be the components of the velocity, expressed as functions of the 
coordinates (x, y, z) of a point and time ¢. The component of the acceleration 
W along OX will be equal to the total differential of the component u(x, y, z, t) 
of the velocity with respect to time, so that we can write: 


du Ou Om , du dy . du æ 
Wise a, ae o T ay at ta Gr? 
or 
Ou ðu 0 Ju 
Mis ap ta tayt ae 
Similarly: 
Ov av Ov Ov 
W=- ta “t ayt 


Ou Ou ðu 1 ap 
dt + Ox + oy + d w= o ox 
dv dv ov Ov 1 Op = 
Bet ge gy a Oe oe oy (73) 
Ow Ow Ow Ow 1 Op 
a ga roay Oe Oe ae 


These are referred to as the hydrodynamical equations in Euler’s form. With 
them must be associated the continuity equation deduced in the previous 
article. Using the present notation, we can re-write (69) in the form: 


dg 
oy 
A characteristic feature of the equations written is the fact that we have 
chosen the coordinates (x, y, z) of a point of space and time ¢ as the independent 
variables when investigating the motion. In certain cases, the coordinates of 
the position of a fluid particle at the initial instant are chosen as independent 
variables instead of the coordinates (x,y,z) of a point; the hydrodynamical 
equations naturally have a different aspect with this choice. 


v+ 


9o de Oe Ou Ov Ow) _ 
a T On” öz w+ o(a +e tHe) =o (74) 


116. Equations of sound propagation. Equations (72) and (73) are valid for 
gases as well as for fluids in the narrow sense of the word. The essential require- 
ment is simply the hypothesis that the internal forces amount to a pressure 
only. We shall assume the motion sufficiently small for the terms containing 
products of the velocities and their derivatives with respect to the coordinates 
to be neglected on the left-hand sides of equations (73). With this, (73) become: 


Ou 1 Op w 1 Op | Ow 1 ap (75) 


w= F,—-——; =F, —-— 


a ee og Be ae eo yt Te o 82 
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or in vector form: 
Ov 1 
—=— = F —— ; 76 
g Tg 8er (76) 
Similarly, neglecting the products of the velocity components and the deriv- 
atives of the density with respect to the coordinates in (74), we get: 


de . 
ee — RIA 71 
BE +eodivv =0 (77) 
Let g, be the constant density of the medium in the state of rest. Let 8 be 


a small quantity characterizing the relative change in density during the 
motion and defined by the equation: 


e = Q (L + 8). 
From this: 
dọ ds 
oo. hee 


the small quantity s being neglected in the denominator (1 + s). It follows 
that we can take 03/0¢ = (1/@) 80/dt, and (77) gives: 


as 


<a div v. (78) 


It can be assumed that the gradient of the pressure is proportional to the gradient 
of s, characterizing the compression or expansion, i.e. 
grad p = e grad s, 


where e is the modulus of elasticity of the medium. On substituting this in 
(76) and taking 9 = 0, here, we get: 


Ov 


=F—— grads. 
Q $ 
We take the divergence of both sides of this equation: 


ð i e .. 
eo ee div grad s. 


On taking into account (78), we can write this as: 


a = a? As — div F ( = |) . (79) 
This is the equation that must be satisfied by the function s of time and of the 
coordinates of a point. The differentiation with respect to ¢ was taken outside 
the divergence sign when taking the divergence of Ov/dt, which is permissible 
since the result of differentiation is independent of the order in which it is 
carried out. 

If there are no external forces, (79) becomes: 


ahd =a? As (a = IZ i (80) 
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This latter is generally known as the wave equation. If we bear in mind that 
s characterizes the amount of compression or expansion, we can say that in our 
case the equation gives the law for sound propagation. The sound sources are 
the parts of space where div F differs from zero. 


117. Equation of thermal conduction. We saw in [108] that the 
quantity of heat passing in time dt through an elementary surface dS 
may be taken as 


dQ = kdt dS Ea = k dt dS |grad„ U(M), 


where k is the coefficient of internal thermal conduction, U is the tem- 
perature and (n) is the direction of the normal to dS. Let (8) be a 
closed surface bounding the volume (v), and let us find the total 
quantity of heat passing through (S). We easily see that: 


dQ = — dt f f k grad„ U d8. (81) 
(s) 


Here, if the temperature decreases in the direction (n) of the outward 
normal, 0U/an < 0, and the corresponding elements of the integral 
are negative, whilst the situation is reversed with increasing tempera- 
ture. Noting the (—) sign on the right of (81) and that heat flows in 
the direction of decreasing temperature, we can say that Q is the 
quantity of heat passing out of (v) in time dt. The heat flowing into (v) 
will be given by (81) with the (—) sign. 

The quantity of heat given out may be alternatively obtained 
by observing the change of temperature inside the volume. We con- 
sider the elementary volume dv; if its temperature increases by dU 
in time dé, the quantity of heat expended must be proportional to the 
temperature rise and to the mass of the element, i.e. the quantity of 
heat is 


y dU -edv = ye dt dv, 


where ọ is the density of the body and y a coefficient of proportionality 
known as the specific heat of the body. The heat given out by the 
total volume is thus 


eR | era a8 
(*) 


where the (—) sign indicates that heat given out is to be understood. 
and not heat acquired. 
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On equating the two expressions obtained for dQ and using (37) of 
[109], we have 


fff yet dv = | f | div (k grad U) dv, (82) 
() O 
i.e. we must have, for an arbitrary volume, 
Ji [ve Se — div (k grad U)|dv =0, 
whence we get k differential equation for thermal conduction: 
ye == div (kgrad U) (83) 
or 
a 0U ə 0U ) ðU 
10 Ge = ae (# Ge) + ay (4S) + ae (Fe): (Sai) 


This equation must be satisfied at all interior points of the body. The 
temperature U depends on the coordinates of the point and on time. 
If the body is homogeneous, y, ọ and k are constants, and (83) can be 


written 
U a aE 
Goes (e=) Pa 
or 
əƏU (3U ZU U 
w ~ a? | as + Oy? 022 ). (84) 


If the thermal phenomenon is stationary, i.e. the temperature is 
independent of time t and ee only on (x, y, 2), (84) takes the form 


; eu U _ 
AU = 0, ive. ual . SES Bye Faa : (85) 


We have thus arrived at Laplace’s equation, already encountered in 
[87] and [114], for the temperature in a stationary process. 

We assumed when deducing the thermal conduction equation (83), 
that no heat sources were present in the body; if this were not the case, 
we should have to replace (82) by another equation: 


ff ve dv = f f faiv (k grad U) dv + ff fedo, 
@) ©) OE 
where the last term on the right-hand side represents the quantity of 
heat originating in the volume (v) and reckoned per unit time. 

The integrand e(t, M) gives the strength of the heat sources con- 
tinuously distributed throughout (v) and can depend on time as well as 
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on the position of the point M. Instead of the differential thermal con- 
duction equation (83) we should obtain an equation of the form 


0U 


vo -a = div (kgrad U) + e (86) 

or, in the case of a homogeneous body, we should have instead of (84): 
ðU l 

a FPU sre es (87) 


Equations (87) and (84) are analogous to (79) and (80) of [116]. 
The presence of heat sources in the thermal conduction equation is 
analogous to that of external forces or, more precisely, sound sources 
div F in the propagation equations. The differential equations, (79) 
or (87), are non-homogeneous in both cases for this same reason, that 
they contain free terms, div F or e, which have to be looked on as given 
functions, in addition to the required functions s or U. An essential 
difference between equations (80) and (84) should be noticed. The 
first contains the second derivative of the required function with 
respect to time, whilst the second contains the first derivative. This 
circumstance makes an essential difference as regards the integration 
of the equations. 


118. Maxwell’s equations. The following vectors are introduced in the study 
of electromagnetic fields: the electric intensity E, the magnetic force H, the 
total current r, the electric displacement D, the magnetic induction B. The 
two basic laws of electrodynamics, which are generalizationsof the Biot-Savart 
and Faraday laws, can be written in the form l 


fa.as=—{ f mas, (88) 
i is) 


ld 
fea=—— | | Bras, (89) 
ò is) 


where c is the speed of light in vacuo. 

The first equation connects the circulation of the magnetic force round 
the contour of a surface with the flux of total current through the surface. 
The second connects the circulation of the electric intensity with the derivative 
with respect to time of the flux of magnetic induction through the surface. 
In these equations, (l) is an arbitrary closed contour bounding a surface (8). 
Furthermore, D and B are related to E and H in a quiescent homogeneous 
medium by: 

D =E; B= yH, 
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where £ and u are constants, called respectively the dielectric constant and 
the permeability of the medium. The total current is made up of two terms, the 
conduction current and the displacement current: 

r= iE +e ea 5 
where A is the coefficient of conductance of the medium. Hence (88) and (89) 
take the final form: 


1 On 
fasas f f (am +e G2) as, (90,) 
1) is) 
1 4d 
f Eds =—— -5 i uH, as. (90,) 


Ò is) 


The integrals on the left-hand sides can be transformed to surface integrals 
by Stokes’ formula: 


f curl, E dS and ffcurl, Eds, 
(5) (S) 


so that the equations now become: 


[f [e cur m (2, +255) ]as =o 
KO) 
f [fecun E +e Za] as =0., 
(S) 


We have from these, by the arbitrariness of surface (S) and therefore of the 
direction (n) of the normal: 


3E 


ccurl H= 1E + e-z > (91,) 
oH 
ce curlE = — 4 -yr ; (913) 


These latter represent the differential form of Maxwell’s equations, and amount 
to a set of six differential equations relating the six components 


Ey, Ey, By Hy, Hy, H}. 
It follows at once from (91,) and (91,) that in the present case the vector: 


OE oH 
AE + E E and Or 


are solenoidal, since their divergences are respectively 
edivcurlH and cdiv curl E 


and therefore vanish [112]. 
It can further be shown that E and H are themselves solenoidal in a given 
region of space, provided they are solenoidal there at some initial instant. 
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Before proving this, we introduce the two quantities 
div &Æ = ọ, = 0; div uH = gpp, (92) 


known respectively as the electric and magnetic charge densities. It now 
follows from the equation 


: gE) A. a. 
div (1B +2 34) = aiv cE + 5 div (eE) =0 
that 
à æ 
a he 


and integration of this first order equation gives us [4]: 


where g, is the value of 9 at ¢ = 0. Consequently, if we have 9, = 0 at an 
initial instant, i.e. 


div E, = 0, 
we shall have ọ = 0 for any t, i.e. 
div E =0. 
Similarly, it follows from (91,) that 
. OH ð 
div -5 = pr tY H =0, 


and if div H, = 0, div H = 0 for any t. 

The latter equation is equivalent to the vanishing condition for the magnetic 
charge, which is usually permissible. 

Other equations can be deduced from Maxwell’s, in which E and H appear 
separately. We take the curl of both sides of (91,) and get: 


— c curl curl E = peeun 
ot 
or, on using (57,) and (91,): 
| we) = 2(, 88 
c (SE —prad div E) =+ 2 (e E +78) 
whence finally 
OE A OE c2 : 


Exactly the same form of equation may be obtained for H. 
In the absence of electric charges, i.e. when div E = 0, (93) becomes: 
OE 4 OE c 
ae fe = AE. 94 
Ot? € Ot eu E (94) 
This is generally known as the equation of telegraphy, since it was first obtained 
when investigating the propagation of current along cables. Finally, if we are 
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concerned with a perfect dielectric, i.e. a non-conducting medium, we have 
A = 0 and (94) becomes: 


oe == a@AE ( = 7z) . (95) 


We have already encountered an equation of this form in [116]. 

If the process is stationary, i.e. vectors E and H are independent of ¢, equa- 
tion (91,) gives curl E = 0, so that E is a potential vector: E = grad g, whilst _ 
the first of equations (92) gives: 


div grad ọ = = or Ag= ae (96) 


In places where 9 = 0, i.e. electric charge is absent, we get Laplace’s equation 
Ag = 0 for the potential g. 


119. Laplace’s operator in orthogonal coordinates. We introduced 
general curvilinear coordinatesinspace in [60]. Wenow consider a partic- 
ular case of these coordinates, when the elementary volume referred 
to in [60] is a rectangular parallelepiped. This case of orthogonal 
curvilinear coordinates is the most important and the most frequently 
encountered in applications. 

Let us take three new variables g,, q>, q instead of Cartesian coordi- 
nates x, y, z in space: 


P(E YZ) = Hs P(LY,2%) = G2; © (x, Y, 2) = q3 (97) 
or, in the form when solved with respect to 2, y, z: 


£= Pi (q Fa, 93)5 Y = V (qi l2» 43); Z= 01 (gis 1z I) - (98) 


On assigning constant values A,B,C to the new variables q,, qo, 3, 
we get three families of coordinate surfaces, the equations of these in 
the x, y, z coordinates being: 


p(x,y,2)= A(X); p(x,y,z)= B(II); w(x, y,z)= C (III). (99) 


We take any two coordinate surfaces from different families, say from 
families (II) and (III); these will intersect in a curve, with the equation 


p(z, y,z) = B; w(x,y,2) =C, 


where B and C are definite constants. Only q, varies along this curve, 
and we can refer to it as the coordinate line q,. Similarly, we have 
coordinate lines q, and qs. 
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We find the square of an element of length in the new coordinates: 


ds? = da? + dy? + dz? = (ge dg, + 3% se dg + get Op. Pdga) + 


1 1 d 2 3 i 
+ (Goda, + Ge dgs + 5 das} Ea + pt dae + Get dg): (100) 


On removing the brackets, we get a homogeneous second degree poly- 
nomial in dq,, dg, dg}. Next we find the conditions for the poly- 
nomial not to contain product terms in different differentials dq. 

We can take, say, the term in the product dq, dg, in (100). The 
coefficient of the product will be: 


2 (fm. oe Ov. , Ov a (101) 


ag dqa | Og, Og, ' Og Oga) 


An elementary volume in the new coordinates (Fig. 97) will be 

bounded by three pieces of coordinate surfaces. Three edges, AB, AO, 

AD, will leave the vertex A of its 

base, corresponding to the values 

1 aa qs q% qz of the new coordinates. 

Only q, varies along AB, only q 

along AC, and only q, along AD, 

If we consider the first and second 

edges, functions (98) are functions 

of q, only on the first edge, and the 

A, dq, B direction-cosines of the tangent are 
proportional to [I, 160] 


H3dq3 


Oy, = Oy, 3o 
ðq? q? Og, ` 


Similarly, the direction-cosines of the tangent to the second edge 
are proportional to: 


bp ay 2o 

09,” Og’ gz ` 
The vanishing of (101) is thus equivalent to these two edges being 
perpendicular. If we also require the vanishing of the coefficients of 
dq, dg, and dg, dg, in (100), this is equivalent to requiring that all three 
edges of the elementary volume in the new coordinates should be 
mutually perpendicular. Hence, the necessary and sufficient condition 
or a system of curvilinear coordinates to be orthogonal is that the expression 
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for ds? should contain only terms in the squares of the differentials, i.e. 
only terms in dqj, gz, and dg. 
We shall assume in future that the curvilinear coordinates are 


orthogonal. 
We now have for ds? an expression of the form 
ds? = Hi dg? + H3 dg + H3 dgh, (102) 
where 
n= (a). + (ap) + (a) 
m = (on) + (an) + (aa) (108) 
m= (sn) + Cee) + Cag) 


If we recall that only one variable changes along each of the edges 
of an elementary volume, we obtain for the lengths of the edges, 
by (102): 

ds, = H,dq; ds, = H,dg,; dss = Hs dgs, (104) 


and an elementary volume in the new coordinates is given by the 
expression 
dv = ds, ds, ds, = H, H, H; dq, dq, dq. (105) 


Now let A be a vector field in space; we know from [109] that its 
divergence at a point M is given by 


Sf Anas 
divA= lim & __, 
@)om ^ 

where (S,) is the surface bounding a volume (v,) of magnitude V, 
which contains the point M and contracts indefinitely towards it. We 
apply this to an elementary volume in curvilinear coordinates q,, 92, q3, 
and find the flux of the field through the surface of the volume. We 
start by finding the flux through the right and left-hand faces. The 
coordinates have the values q, g>, qa at the base vertex A, whilst q, 
will have to be replaced by (q, + dq,) on the right-hand face. Further- 
more, the direction of the outward normal on the right-hand face 
coincides with the direction of the coordinate line q}, the direction on 
the left-hand face being the reverse. The component A, along the out- 
ward normal (n) is thus Aq, on the right-hand face and (— Aq,) on the 
left-hand face, where Ag, is the projection of vector A on the tangent 
to the coordinate line q, or, as we usually say, on the coordinate line q,. 
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In view of the smallness of the faces, we can replace the surface inte- 
grals SfAn dS over them by the simple product of the integrand and the 
area of the corresponding face, which gives us for the flux through 
the right and left-hand faces: 
Aq, As, d83 | q,+ag, and — Ag, ds, ds3 lq, 
the flux through both faces being 
Aq, ds, dss | 9, +49, — Ag, ds, dss | ¢, 
or, by (104): 
Aq, H, H; dg, dgs | q,+ dq, Ag, H, H, dq, dgs | = 
= [H, H; Aq, | a,+40, — Ho Hg Aq, | 0,] 492 dg- 
We finally replace the increment of the functions by the differential, 
and get for the flux through the right and left-hand faces: 


(H, H, A 
sere Ti a) dg, dg, dgs. 


Similarly, the flux through the front and rear faces is 


0H, H, A 
aa a 2). dg, dg, dg 


and through the upper and lower faces: 


3H, H, A 
AE et dg, dg, dgs. 


We add these three expressions and divide by the volume obtained 
from (105) and arrive at an expression for the divergence of the field 
in orthogonal curvilinear coordinates: 


div A= 


1 (H, H, An) (H, H, Aq) (H, H, Aqa) 106 
HEE; l Og cs wA + ZA | 1198) 
Now let A be a potential field, i.e. the field of the gradient of a 
function U(M), so that A = U. 
The component A,, is now the derivative of U with respect 
to the direction q,: 


and similarly: 
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Substitution of these expressions in (106) gives us the expression for 
Laplace’s operator in curvilinear orthogonal coordinates: 


k l 0 (H,H, 3U 
So et ac (Cee E 
U 1V gra H, H, H, 3q, H, ðq, $ 
fa] H, H, ƏU ) f] ( H, H, 0U )] 
a . 107 
A H, dq A H; 095 ( ) 


Laplace’s equation AU = 0 takes the form, in coordinates q,, q>, q3: 


to) (= £) a (57 | 
aqi H, aqi T ag, H, Og, a 


} 
I 


ə (=a. ai (108) 


ðq L Hz, 095 
1. Spherical coordinates. In this case, expressions (98) become [59]: 
x=rsin@ cosy; y=rsinOsing; z= rcos®ð, 
where q; = 7, qa = 9 and q, = p. We find ds?: 
ds? = (sin 6 cos g dr + r cos 8 cos p d0 — r sin 0 sin p dø)? + 
+ (sin 6 sin y dr +- r cos 6 sin g dé + r sin 6 cosy dọ)? + 
+ (cos 0 dr — r sin 6 d0}? , 
or, on removing the brackets: 

ds? = dr? + r? d0? + 7? sin? 6 dg?, (109) 


ie. H, = 1, H, =r, H, = r sin 6, with 0 < 0 < a, so that H, > 0. 
Substitution in (108) gives us Laplace’s equation in spherical co- 
ordinates: 


8 Lon a 3U ə f. n 3U ap il avy) _ 
=" sin 0 -g ae (sin o Er- ET y=? 


or 


ə aU 1 Ə f. 0U 1 U i 
G or }+ sin 6 06 (sino 30 )+ sin?6 Og? == 0 hl) 


We find the solution of this equation depending only on the radius 
vector. With this, we have to take 3U/30 = 0U/dp = 0, so that 


“Or. or 
whence 
o 3U =. ww a 
ee =—C, or ge ae 
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and we find on integration: 


Cı 
r 


U = 


+ 0,, (111) 


where C, and C, are arbitrary constants. We recall that 7 is the distance 
of the variable point M to the fixed point M,, which we can choose 
as the origin. In particular, with C, = 1 and C, = 0, we have the 
solution l/r, already mentioned in [87]. 

2. Cylindrical coordinates. Here, 


L=Ecosp;, Y=esng; 2=2, 


so that g, = 0, q = p, q = z. We have for d2: 
ds? = de? + o* dg? + dz?, 


whence H, = 1, H, = 0, H} = 1, and Laplace’s equation becomes in 
cylindrical coordinates, by (108): 


wlew) + oe Ce ae) + ae (eae) =O 


ə au 1 U 2U 
æ le de J o p? Teg Eh (112) 


or 


It is easily shown, as above, that the solution depending only on 
the distance o from OZ is: 


U = 0, loge + Or, (113) 


Let U be independent ofz, i.e. U has the same value at corresponding 
points of all the planes parallel to the XY plane. It is now sufficient 
to consider the values of U on the XY plane only (the plane case). 
Laplace’s equation in rectilinear rectangular coordinates gives here: 


eU aU 


gat F Oy? =O 


If the plane is referred to polar coordinates (0,p), we get the equation, 


pean ə 3U eu 
1 2 
w (2 pears ape 7O- 


It is clear from (113) that in the plane case a solution of Laplace’s 
equation will be given by log e, where ọ is the distance of a variable 
point from a fixed point. Of course we could take the solution log 1/e = 
= —log ọ instead of log 9. The fundamental solution of Laplace’s 
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equation in three-dimensional space is thus the inverse of the distance 
of a variable point from a fixed point, whilst it becomes the logarithm 
of this inverse distance, or of the distance itself, in the plane case. 


120. Differentiation in the case of a variable field. Let a scalar field U(t, M) 
and a vector field A(t, M) be given in space, the field varying with time in 
both cases, i.e. the value of the scalar or vector is a function of time at every 
point. Further, let all space be given a motion represented by the velocity 
field v. We shall suppose this latter vector to be also dependent on time ż. 

We investigate the variation of U with time. We can do this in two ways: 

1. We can fix our attention on a definite point of space and find the rate of 
change of U at this point. We arrive at the partial derivative 0U/0t, which 
may be called the local derivative, since it relates to a definite position in space. 

2. We can find the rate of change of U whilst fixing our attention on a 
definite particle of the moving medium (substance). When differentiating with 
respect to time, we now have to take into account the motion of the points of 
the medium themselves, i.e. we have to differentiate U both directly with 
respect to é and indirectly via the coordinates (x,y,z) of the point M. 
We arrive in this case at the total, or as it is sometimes called, the particle 
derivative : 


dU eu , ƏV de , UU dy , ðU dz _ 
d ~= a o@ “de ' oy dt de “de 


0U ðU ðU 3U 
sra toa a tt ae e 


which may be written in the condensed form: 


dU 0U 
-i = g tyr grad U. (114) 


We had an example of a particle derivative in [114], where we considered 
the total derivative with respect to time of the density of a particle of a con- 
tinuous medium in motion. 

Similarly, we have for the variable vector A(t, M) in a moving medium: 


dA _ ðA 3A A 3A 
a ae tae t ty t e 
or 
dA _ ðA 
-ir =g t (werad) A, (115) 


where the meaning of the symbol (v grad) is given by: 


3 ð a 
(v grad) eee Hey Bt te Be . 


The first term in (114) or (115), i.e. the partial derivative with respect to 
time, represents the change at a given place, whilst the second term represents 
the result of the motion of the medium itself. 
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We now establish certain formulae for differentiating integrals over domains 
related to moving media. The dependence of the integral on time is here due 
both to the integrand being a function of time and to the domain of integration 
varying with time. We can find the derivative with respect to time ¢ by looking 
on the two-fold dependence on ¢ as a dependence on two variables then applying 
the rule for differentiating functions of a function [I, 69]. It amounts in essence 
to a case of the principle of superposition of indefinitely small operations. 

The derivative of the integral with respect to ¢ will consist of two terms: 
the first is found by assuming that the domain of integration is fixed and is 
given by simple differentiation with respect to ¢ under the integral sign [80], 
whilst the second only takes into account the change in the domain of integra- 
tion, the integrand being meantime assumed constant. 

We consider various cases. 

1. Let (v) be a variable volume and U(t, M) a scalar function. We find the 


expression for the derivative: 
ar 
T f f U dv. 


(v) 


Every element dS of the surface (S) bounding (v) describes a volume dt v, dS 
in time dt, where (n) is the direction of the outward normal to (8) [114]. 

On multiplying this change in volume by the corresponding value of the 
integrand U and summing over the total surface (S), we get the change in the 
value of the integral due to the change in the domain (v) itselff: 


dt f f Ue, ds. 
(S) 


On dividing by dż and adding the term due to the change in the integrand, 
we get the e for the derivative of the integral in the form 


aljosa EJ ee dS, 


whence, on applying Ostrogradskii’s formula, we have: 


T aves 5 +div (Uv)| ae. (116) 


On expressing 0U/dt in terms of dU/dt in accordance with (114) and using 


(57,) [112], 
div (Uv) = U div v + v -grad Ọ, 


we can re-write (116) as 


dt y Sfo- Sla + U divy | dv. (117) 


() 


{ If the angle between the direction of v and (n) is obtuse, v, will be negative 
and the change in volume dł v, dS will also be negative. 
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2. We now consider the derivative of the flux of a variable vector field 
A(t, M) through a moving surface (S): 


d 
| f Anas. 
(S 


Here, (S) is a surface related to a moving medium and (n) is the direction of 
the normal to (S). One term in the required ex- 
pression will be 5 
dAn (n) 
f a oS: (118) 
(S) 
The second term, due to the movement of (S) 
itself, is found as follows. Let (l) be the contour 
of the surface and ds a directed element of the 
contour; we define the direction of (l) later (Fig. 
98). Surface (S) describes a volume (dV) in time 
dt, the three boundary surfaces being: the position 
(S;) of (S) at time t, the position (Sat) at time 
t + dt, and the surface (S’) described by (l) in 
the interval dé. An elementary area of (8°) will be b 


S 
dS’ = |ds x v|dt. A d 
Let (n) be the direction of the normal to (S;) Fre. 98 


and (S;,q:), taken in the same direction, and let 

the normal to (S;,4;) be directed outwards from 

(ôV). Let (n) also denote the direction of the outward normal from (ôV) to 
(S’), and let the direction of (l) be specified such that ds, v and (n) to (S°) 
have the same orientation as the axes. With this, clearly, 


A, dS’ = A. (ds x v) dé, 
so that Ostrogradskii’s formula gives us: 
ff Ands — $ f Ands + dtf A. (dsxv) = f f f div Ado. (119) 
(Siar) (Sì) ® (6V) 


The (—) sign in front of the integral over (S,) is due to the normal (n) over(S,) 
being directed inwards to (6V). But we know from [105] that 


A+ (ds xv) = ds- (vx A) = (vx A), ds, 


where (v X A), is the projection of v x A on the direction ds, and hence, by Stokes’ 
formula: 
J A+(dsxv) =f (vx A), ds = f f curl, (vx A) dS. 
0) (1) (S) 
On dividing (ôV) into elementary volumes dv = v, dS dt, where dS is an 
element of (S;), we get by (119): 


Sf AnaS — f f An dS = dt f [v div A—curl, (vx A)] ds. 
(Styaz) (S) (S) 
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We divide both sides by dt and pass to the limit to obtain the term of the 
derivative due to motion of (S). We add term (118) and get finally: 


gff Aa dS = y) [P + vu div A + curl, (axe) ds - (120) 

(S) 
The term in curl, (A x v) is absent if (S) is a closed surface, and the formula 
in this case follows at once from (116). Suppose, in fact, that (v) is a variable 


volume bounded by the closed surface (S); we obtain, on applying Ostrogradskii’s 
formula and (116): 


d ar} | 4s ndS = -S Aaa e div (v div A)| av = 
AE ravaje- Jj no, div A) as. 


3. We now find the derivative of the circulation of a variable vector over a 
moving curve: 
d 
al A s ds. 


(0) 
As usual, one term in the required expression will be 


ES ds. (121) 
Ù 


The additional term, due to the motion of the curve itself, is obtained as 
follows. The curve (l) describes a surface (ôS) in time dé, the four boundary 
lines being (Fig. 99): curve A,A,, which is the position (J,) of (1) at time ¢; curve 
B;B,, the position (l;,4:) of (2) at time ¢ + dt; and finally, the curves A,B, 
and A,B,, described by the ends A, and A, of (l) in time dt. Stokes’ formula gives: 


f A,ds+ f A,ds— f A,ds+ f A,ds=ffourl,AdS, (122) 
ti) (AB) (usar) (BA) (68) 


the integrations over (l) and (liat) being taken from A, to B, and A, to B, 
respectively, whilst (n) is the normal direction to (ôS) such that the vectors 
ds, v and (n) on (l) have the same orientation as the axes. The integrals over 
the small curves (A,B,) and (B,A,) can be replaced by single elements, i.e. 
the products of integrand and length of path; we have for these the scalar products 
of A and the small displacements v dé: 


A@).y@ dt and — A). xD de, 
where the (—) sign is used because integration over curve B,A, is carried out 
from B, to A,, i.e. in the opposite direction to v, whilst the superscripts denote 


values corresponding to the ends A, and A,. 
An elementary area dS is given by 


dS = |ds xv | dt, 


=- 
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and the normal (n) to the surface has the same direction as the vector ds X v, 
so that clearly: 


curl, A dS = (ds X v) - curl A dt = (v x curl A)» ds dt, 
and (122) gives: 
§ Agds—f A, ds = A®). v2) dt — AQ). yO) dt + dt f (curl A Xv) ds. 
(i) (4) 


(heat) 


On dividing both sides by dé, passing to the limit and adding term (121), 
we get the required derivative, where we write simply (J) instead of (l): 


T [As ds = A@). y(2)— AQ). y) + IE + (curl A xv)s]| ds. (123) 
® (i) 


Fie. 99 


If (1) is a closed curve, the terms outside the integral fall out, and we get: 


d OAs 
S | As ds = | [ ae (curt Ax v)s |ds. (124) 
(Q) 0) 
This expression can be obtained more simply by transforming the line 
integral by means of Stokes’ formula, then using (120). 


We further consider the circulation of the velocity along the moving contour 
(l). By (123): 


pf tds = vO. ve — v9. vo + f [EE H (curl vx 2 ee 
0) (9 


= {v®) j — fy) |2 + f [ z + (curl v xv) ds. (125) 
(OS 
The component of curl vx v along OX will be: 


E av, _ Ov, ar avy Z vy 
(curl v X v) = ( Oz Ox ) Yz ( Ox “Oy Jo 
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On removing the brackets and adding and subtracting v,.0v,/@x, we can write: 
_ vy Ov, avy, 
(curl v Xv), = On wta oTr 
Ov, vy Ov, 
—( Ox Px T Ox ty + Ox o), 
and hence we easily obtain, using (115): 
dv ðv 1 ðv 1 
a eee a 2— ee SS E 
curly Xv = a at 5 grad |v|? =w aE 5 grad |v|?, 


where w is the acceleration vector. We have on substituting in (125): 


+ f v de = fv |? — fr je + f (ws — 5 grad, |v) ds = 
ù À 
(126) 


= > [v2 — fy) 2] + fu ds, 
() 


since clearly: 
§ grad, |v]? de = |v® |2 — |wj2, 
® 


CHAPTER V 


FOUNDATIONS 
OF DIFFERENTIAL GEOMETRY 


§ 12. Curves on a plane and in space 


121. The curvature of a plane curve; the evolute. The present chapter 
deals with the basic theory of curves and surfaces; it starts with 
plane curves, after which we pass to curves in space and surfaces. 
Our account makes use of vectors, so that the reader must carefully bear 
in mind the initial sections of the previous chapter up to and including 
[107], which deals with the differentiation of vectors. We start by 
proving a lemma: 

LEMMA. If A is a vector of unit length (a unit vector) which depends 
on a scalar parameter t, we have dA/dt- A = 0, i.e. dA/dé | A. In 
fact, A+ A = 1 by hypothesis, and we obtain on differentiating this 
equation with respect to t: 


dA dA 
ar AtA O 
or, by the independence of a scalar product on the order of its factors: 


na -A=0, ie. oA LA, 
where the orthogonality condition obviously only has a meaning in 
the case when dA/dt differs from zero. 

Here and in future, we always assume the existence and continuity 
of the derivatives mentioned in the text. 

Let (Z) be a plane curve and let the position of a variable point M 
on the curve be described by a scalar parameter ¢. The curve may be 
characterized by the radius vector r(t) from some fixed point O to a 
variable point of the curve (Fig. 100). The derivative dr/dt gives a 
vector directed along the tangent to the curve, as we saw in [107}, 
whilst if the length of arc s, measured from some definite point of the 
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curve in a definite direction, is taken as the parameter, dr/ds is the 
unit tangential vector t, the direction of which coincides with the direc- 
tion of increase of parameter s along the curve: 


dr 
ds 


=t. (1) 


The derivative of the unit tangential vector with respect to s is called 


the curvature vector: 


dt 


t(s) 


t(stAs)~t(s) 


tlt) 


{t) 
Fig. 100 Fie. 101 


The length of this vector characterizes the rapidity of change in the 
direction of the vector t, and is called the curvature. 

The curvature vector is perpendicular to the tangent by the lemma 
just proved, i.e. is directed along the normal. 

Furthermore, it follows at once from the definition that it is directed 
towards the concave side of the curve, since the difference t(s + As) — 
— t(s), with As > 0, is directed towards this side (Fig. 101). 

The length of vector N is called the curvature, as already mentioned, 
and if we introduce the notation 


N => (3) 


the inverse of the curvature, o, is called the radius of curvature. We 
now write n as the unit curvature vector, i.e. the vector of unit length 
with the same direction as N. 

If the length | N | = 0, we must take ọ = œ, whilst n is not defined. 
For instance, if (Z) is a straight line, |N] = 0 at every point of it, 
and we can choose either of the two directions for the normal lying in 
the plane concerned. We shall assume in future that | N | # 0. 
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We have by (3): 
1 

N= > (4) 
Let us mark off along the direction of n, i.e. along the normal direction 
towards the concave side, the segment MC, equal to the radius of 
curvature at the point M of the curve (Fig. 102). Its end C is called 
the centre of curvature of the curve at the point M. If M moves 
along the curve (L), O varies and describes a curve (L), called the 
evolute of curve (L), i.e. the evolute of 
a curve is defined as the locus of its 

centre of curvature. 

We must find the derivative 
du/ds for the sake of what follows. 
Since n is a unit vector, dn/ds | n 
i.e. dn/ds is directed along the tan- 
gent. We obtain on differentiating 
the obvious equation t -n = 0 with 
respect to s: 


But N and nhave the same direc- 
tion, and N- n = l/ọ by (4), so 
that it follows from the above equa- 
tion that t + dn/ds=—1/o. We see 
from the fact that t and dn/ds are parallel that dn/ds is in the opposite 
direction to t and has a length 1/9, i.e. 


Fie. 102 


dn 1 
ds rg i: (5) 

Let r be the radius vector and s the length of arc of the curve (L) 
as above, and let r, and s, be the corresponding magnitudes for the 
evolute (L). On differentiating with respect to s the equation (Fig. 102) 


r =r+on 
we get 
dr, de dn 
ds Ea ds Ber ds ’ 
or, by (5): 
dr, de dr, de 
dg gg ee Hoe Sage ag (6) 
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The right-hand side of this equation is a vector directed along the 
normal to (L), whilst the left-hand side is a vector directed along the 
tangent to the evolute; hence the normal to (L) is parallel to the tan- 
gent to the evolute. But both these latter past through the same point 
C and are thus identical, leading us to the first property of evolutes: 
a normal to a curve is tangential to the evolute at its corresponding point. 

On recalling the definition of envelope, we can state a second pro- 
perty of evolutes: the evolute is the envelope of normals to the curve. 

It seems natural to take the length of arc s, as the parameter for 
the evolute, and by the rule for differentiation of a function of a 
function: 

dr, dr, ds, ds, t 
ds — ds, ds ds ? 


where t, is the unit tangential vector to the evolute. We get on substi- 
tuting in (6): 
ds, t de 


dash de ™ 


and we see by comparing the lengths of the vectors on the two sides 
of the equation that 


ds, | __ 
ds 


de 


~ | ds 


» ie. |ds,| = |do]. 


If we assume for simplicity that ọ and s, are increasing over the. 
sections of the curve and evolute concerned, we can write ds, = do.t 
Integration of this relationship over the section concerned shows that 
the increment in length of arc of the evolute is the same as the incre- 
ment in the radius of curvature of the initial curve. Hence we have a 
third property of evolutes: the increment in the radius of curvature of 
the curve over a section of monotonic change is equal to the length of arc 
between the corresponding points of the evolute. In the case of Fig. 102, 
this property is expressed by: M,C, — MC = V CC 

We take definite axes OX, OY on the plane and let ọ be the angle 
that the tangential direction t forms with OX. We express the unit 
vector in terms of its components: 


t = cos gi -+ sin gj, 


t On the assumption that ọ is varying monotonically, we can always choose 
the direction for measuring s, such that s, is increasing monotonically with e@. 
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where i and j are the unit vectors along OX, OY. We obtain on dif- 
ferentiating with respect to s: 


A dy . dg . 
N= — sing 2 i+ cosp- j, 
whence the square of the length of the curvature vector is given by: 


1 
“ge 


= (—sing 2)’ + (cosp F or = 


EA 
ds 


This is the same expression for the curvature as obtained in [I, 71]. 
Let the equation of (L) be given explicitly as 


y = f(x). (7) 


The equation of the family of normals to the curve will be 


Y—y=—~-(X—2) or (X—z)+¥(¥—y)=0. (8) 


Here, (X, Y) are the current coordinates of the normal, (x, y) are 
the coordinates of the point M of (L), and y is the function (7) of xv. 
The role of parameter in equation (8) of the family of normals is played 
by the abscissa x of a variable point of the curve. On applying to (8) 
the usual rule for finding the envelope [10], two equations have to be 
written: (8), and a new equation obtained by differentiating (8) with 
respect to the parameter x: 


Fag ne ee 
—1l+y' (Y—y—y?=0. 


Elimination of x from these equations gives an equation connecting 
X and Y. This is in fact the equation of the normal envelope, i.e. the 
equation of the evolute. An alternative procedure is to solve system 
(9) for X and Y in terms of the parameter v, so as to obtain the para- 
metric equations of the evolute: 


(9) 


X= LCH, yay, (10) 


If the equation of (L) is itself given parametrically, the derivatives 
of y with respect to x in (10) have to be expressed in terms of the 
differentials of the variables [I, 74]: 


a(34) 
,_ dy , ra dz} __ d?y dx — d?r dy 
Ya? Y 5i T dg? i 
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Substitution of these expressions in (10) gives us the parametric 
equations of the evolute in this case as: 


eo dy(da? + dy?) | yy 2: da(daz? + dy?) 


d?yda—d’ady ’ d?y dx — d?e dy (11) 


Examples. 1. We find the evolute of the ellipse: 
2 
—+4 = (a >b). 


On writing the equation of the 
ellipse in the parametric form 


x =acost; y=bsint 


0 


Frc. 103 Fie. 104 


and substituting in (11), we find after simplification: 


a? — b2 
X =——— cost, 
a 


2 p2 
yee sin3t. 


We eliminate parameter ¢ from these two equations. We multiply the first 
by a and the second by b, raise to the power 2/3 and add; this gives us the evolute 
of the ellipse in the implicit form: 

2 2 Bint 2 
as x3 + b3 Y? = (a? — b?)?. 

The evolute is easily plotted by using these equations. The radius of curvature 
attains its least and greatest values at the vertices of the ellipse and the cor- 
responding points of the evolute are singular points (cusps) (Fig. 103). 
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2. We find the evolute of the parabola y = az®. We easily find, on using 
equations (10): 


1 
= — 4a2x3 D 2 
X = — 4a’x3, Y zg T 302. 
Elimination of the parameter z now gives us the evolute of the parabola 
as explicitly (Fig. 104): 


l 3 2 
Y = — —— X3 
2a à 2 V2a 


3. We take the cycloid: 


x=a(t—sint); y=a(l—cost). 


y 


Fic. 105 


We use equations (11) to find the parametric equations of the evolute as 
X =a (t +sint); Y =—a (l — cost). 


This may easily be shown to be the same curve as the original cycloid, 
except for being displaced relative to the axes (Fig. 105). We simply put t = 
= Tt — 2, and write the last equations as: 


X + an =a (t — sin t), 
Y + 2a =a (1 — cost), 


whence our statement follows immediately. 


122. Involutes. The curve (L) itself is called the involute of its 
evolute (Z,). A rule for drawing the involute of a given evolute is 
easily derived from the properties of evolutes. If C is a variable point 
of (L) and s is its length of arc, we cut off a section OM = s + a, 
where a is a constant, in the negative direction along the tangent to 
(Lı) at C, and consider the locus (L) of the end M. This locus may 
easily be shown to be the required involute (Fig. 106). It is sufficient 
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to show that CM is normal to the curve (L). Let r and r, be the radius 
vectors to (L) and (L), as above, and let t, be the unit tangential 
vector to (£,). By construction: 


r=r,—(s,+a@)t, 


b whence differentiation with res- 
pect to s, gives: 


dr dt 
T 7 t, —t, —(s, + 4) ae: , 

: dr dt, 

1.e. ‘dg ee) ds, . 


It is clear from this that the vector 
dr/ds,, parallel to the tangent to 
Fra. 106 (L), is at the same time parallel to 
the vector dt,/ds,,i.e. to the nor- 
mal to (L); hence it follows that the tangent CM to (L) is normal to (L). 
We can assign arbitrary values to the constant a in the expression 
OM = s, + a, so that an infinite number of involutes can be obtained 
for a given evolute. It follows from the method of construction that 
any two involutes have a common normal, the intercept of which 
between the curves maintains a constant length equal to the difference 
between the values of the constant a taken. Two such curves are said 
to be parallel. 


123. The natural equation} of a curve. The curvature along any 
curve is a definite function of the length of arc: 


= Ke). (12) 


We show that, conversely, a single curve is defined by any equation 
of the type (12). We fix the direction of the x axis arbitrarily and let 
p be the angle between the tangent to the curve and this axis. We 
know that 1/@ = + dy/ds, and (12) gives: 


SE = No), 


whence 
S 


p= f Hs)ds +0. 


0 


+The natural equation is also known as the intrinsic equation. 
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We may assume that the direction of the x axis coincides with the 
direction of the tangent at s = 0, so that we may take C = 0 in the 
last equation, i.e. we obtain as an expression for the angle g: 


S 
g = -+ F(s), where F(s) = f f(s) ds. 
0 
We know moreover [I, 70] that 


dr Saas 
da 7 COSY; 
whence, by the previous equation: 


g= f cos [F(s)] ds + Cj, 
y= + f sin [F(s)] ds + C. 
9 


If we locate the origin at the point of the curve for which s = 0, we 
must take C, = C, = 0, and we get the fully defined curve 


x= f cos[F(s)] ds; y= + f sin [ F(s)] ds. (12) 
? ò 


The + sign merely gives symmetry with respect to the z axis. 

We have now shown that (12) can correspond to a curve defined 
in the above sense and that equations (12,) must specify the curve 
parametrically with the coordinate system chosen. It may easily be 
verified that the curvature of the curve defined by (12,) in fact has 
the value given by (12). 

We speak of (12) as the natural equation of the curve, in the sense 
that it is independent of the particular coordinate system chosen, 
whilst a fully defined curve corresponds to it (except for symmetry). 

Examples. 1. If (12) has the form 1/@ = C, i.e. the radius of curvature is 
constant, we know that a circle satisfies the equation [I, 71]. It follows from 


the above that a circle is the only curve with constant radius of curvature. 
2. Let the curvature 1/9 be proportional to the length of arc: 


1 


— = 2as, 
Q 


where 2a is the positive coefficient of proportionality. The above working gives 
in this case: 


S 


S 
z = Í cos (as?) ds; y = § sin (as?) ds. (13) 
0 0 


358 FOUNDATIONS OF DIFFERENTIAL GEOMETRY {124 


We can say from the convergence of the integrals [83]: 
f cos (as?) ds; f sin (as?) ds 
0 0 


that on indefinite increase of s the curve will tend to a point of the plane with 
coordinates equal to the values of these integrals, the point being approached 
by spiralling about it (Fig. 107). If s is assigned negative values in expressions 
(13), we get the portion of the curve contained in the third quadrant of the 
axes. The curve obtained here is known as a Cornu spiral and is encountered 
in optics. 


Fia. 107 Fie. 108 


124, The fundamental elements of curves in space. A curve (L) in 
space can be defined by specifying the radius vector r(t) from the origin 
to a variable point M of the curve (Fig. 108). If we take the length 
of arc s as the parameter ¢ and differentiate r with respect to s, we get 
the unit tangential vector to the curve [107]: 


dr 
j 7t (14) 


The derivative of t with respect to s is known as the curvature vector: 
dt 
and the length of this defines the curvature 1/ọ, the inverse g being 
called the radius of curvature. The vector N is perpendicular to t, as 
in the case of a plane curve, and the direction of N is known as the 
direction of the principal normal to the curve. If n denotes the unit 
vector in the direction of the principal normal, we can write: 


N= n. (16) 
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Now let b be the unit vector perpendicular to t and n: 
b=tXn. (17) 


This is known as the unit binormal vector. 

The three unit vectors t, n and b, having the same orientation as 
the coordinate axes, are said to form a variable triad related to the curve 
(L). If (L) is a plane curve, vectors t and n lie in the plane and the unit 
binormal vector b is therefore a constant vector of unit length perpen- 
dicular to the plane. In the case of a non-plane curve, the derivative 
db/ds characterizes the deviation of the curve from the plane form 
and is called the torsional vector. We show that the torsional vector is 
parallel to the principal normal. From (17): 

db dn 


F =Nxn ttx -y 3 


But N and n have the same direction so that their vector product 
is zero, i.e. 


= =tx —_, (18) 


whence it follows that db/ds and t are perpendicular. We know on the 
other hand that db/ds, the derivative of a unit vector, must be perpen- 
dicular to b itself. Hence db/ds, erpendicular to t and b, must in fact 
be parallel to n, and we can write 
$n, (19) 

where the numerical coefficient 1/1 is called the torsion of the curve, 
the inverse t being the radius of torsion or radius of second curvature. 
Unlike the curvature 1/9, l/t can be either positive or negative. Need- 
less to say, the existence of the tangential, curvature and torsional 
vectors is bound up with the existence of the derivatives in terms of 
which they are expressed. 

We now obtain formulae for calculating the curvature and torsion. 
On taking Cartesian axes OX, OY, OZ, with the corresponding unit 
vectors i, j, k, we can write: 


t=aityt+zk; t= a qd tas k; 


_ @az., dy. , dz 
N= ds? L de J t ap k, 
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whence we obtain for the length of the vector N: 
1 (dy d2y \2 d2z \2 
eo har) ae ae) (20) 


It follows from (19) that the torsion 1/t can be expressed as a 
scalar product: 


l1 db 
“tds 
or, by (18), ; P 
n 
— = (e x a) n 
On substituting for n from (16) 
n= oN, 
we get: 
1 d(oN) de dN 
= (tx ds ) N [ex (a + a )] Ns 
a dN 
= 9-52 (ExN)-N + o- (tx I ) N. 


The vector product t x N is perpendicular to N, so that the first term 
in the last expression is zero; hence 


Lae) a 


or, on transposing the factors in the vector product: 


T 


=e (2 xt] N. 


We get finally, on carrying out a cyclic change of the vectors and 
using (14) and (15): 


l dr d?r dèr 
( ds ds? ) 


—=—¢ er Fe (21) 


T 


It may be noted that the coefficient of (— ọ°) is the volume of the 
parallelepiped formed by the vectors dr/ds, d?r/ds?, dr/ds* [105]. 

We return to expression (20) for the curvature. It is assumed in this 
that the coordinates x, y, z are given as functions of the length of arc. 
We now obtain a new form of (20), suitable for a curve given in any 
parametric form. We shall need to express the derivative of the coor- 
dinate with respect to the length of arc in terms of the differential of 
the coordinate. Differentiation of the expression 


ds? = da? + dy? + dz? (22) 
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gives us: 
ds d?s = da d?a +dy d2y + dzd2z. (23) 
We have, moreover [I, 74]: 
de _  dwds—d’sde | dèy  dtyds—dtsdy , 
ds? ds3 » de? — ds? d 
d?z d?z ds — d?s dz 
ds? des? a 28) 


We obtain on making these substitutions in (20): 
1 
“oF = 


_ da*{ (d#x)?+ (d*y)?-+ (d#z)*]_ 2ded*s(dad2x-++-dyd*y+dzd*z)-+ (d?s)?(da*-+dy?-dz?) 
= dsé 


or, by (22) and (23): 
1 
= 


__ (da?-+dy?+-dz*)[ (d#a)?+4+-d?y)?-+ (d?z)?]_ (da d?x+dy d?y+-dz d? z)? (25) 
= da i 


We now recall from [104] an elementary algebraic identity that we 
shall need: 


(a? + b? + c?) (a +b} + of) — (aa, + bb, + 004)? = 
= (be, — cb,)? + (ca, — ac,)? + (ab, — ba, )?. (26) 
On applying this identity to the numerator of the right-hand side 


of (25), we can write the square of the curvature in the final form: 


1 A?-4+ Bt408 
OF Gat aye day? (27) 


where 
A = dy d?z — dzd?y; B = dzd?zx — de d?z; 
C = dx d?y — dy d2z. 
If (Z) is the trajectory of a moving point, the velocity vector will be defined 
by 
dr ds 


v=, = t. 


dt dt 
Further differentiation with respect to time gives us the acceleration vector 


dv = d2s i ds dt 
d Tae toa ae 


WwW = 
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or by (15) and (16): 


d?s ds ds dt d?s v? ds 
daa tta a d a g” (»= | 


w= 


whence it is clear that the acceleration vector has a component along the 
tangent equal to d?s/dt? and one along the normal equal to v?/ọ, whilst the 
component along the binormal is zero. 


125. Frenet’s formulae. We introduce a notation for the direction- 
cosines of the axes of a movable triad with respect to fixed coordinate 
axes, as indicated in the accompanying 


table. 
x Y Z Frenet’s formulae give the derivatives 
-/- of the nine direction-cosines written with 
t a B y respect to s. 
The components of unit vector t are 
m a, B, y a, B, y, and the expression 
Noi 
b ag Bs Yz 
leads to the first three of Frenet’s 
formulae: 
da _ a, dp oh, dy _ 7 
ds ọọ ? ds e? ds ` g’ (28) 


Similarly, (19) leads to the three further formulae: 


da, a , afs _ B . dy, _ vi 
ds «’ dst? dst" (28,) 
Consideration of the movable triad gives us directly n = — txb, 
and we obtain by differentiation with respect to s: 
ds e T e T 


This gives the final three formulae: 


da o eoan dh _ Bh, 
ds e tT? d e T? 
dy Ye 
ds e T’ (28,) 


We can easily show by using (28) that if the curvature 1/0 ts zero 
along a curve (L), (L) is a straight line. The identity 1/9 = 0 gives, 


in fact, 
da dp _ ay 


ds ds ds 


= 0, 
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whence it is clear that a, £, y are constants. But we know from [I, 160] 
that the direction-cosines a, 8, y of the tangent are equal respectively 
to dz/ds, dy/ds, dz/ds, and since these are constant, the coordinates a, 
y, z must be first degree polynomials in s, i.e. we have in fact a straight 
line. 

It may easily be shown in a similar way that if the torsion is zero 
along a curve, the curve lies in a plane. 


126. The osculating plane. The plane defined by vectors t and n 
is called the osculating plane to the curve. The normal to this plane is 
given by b, the direction-cosines of which we proceed to find. 

Since b is a unit vector, its direction-cosines are equal to its compo- 
nents bx, by, bz. It follows from (17) that: 


a, = b, = ty 2, — t y3, 
bz = by = t, ny — by Mes (29) 
Ya = bz = ty ny — ty Ny 


where ty, ..., Ny, ... are the components of t and n. As we saw above, 
tx, ty, t, are proportional to dx, dy, dz; nx, ny, Nz are proportional to the 
components of N, which are equal to d?æ/ds?, d?y/ds?, d?z/ds?, these 
latter being in turn, by (24), proportional to the differences 


d?gz ds — d?s dz, d2yds—d?sdy, d?zds— d?s dz. (30) 


We can thus replace tx, ty, t, by dæ, dy, dz in (29), ny, ny, Nng being re- 
placed by differences (30). On cancelling, we find that the direction- 
cosines of the binormal are in fact proportional to the expressions: 


A = dy d?z — dz d?y; oe (31) 


C = dx d?y — dy dx, 


which we introduced above [124]. If we take (x, y, z) as the coordi- 
nates of a variable point M of the curve (L), we can write the equation 
of the osculating plane as 


A(X —a) + BY —y) + CZ — 2) =0. 


At points where the length | N| = 0, i.e. ọ = œœ, all three of ex- 
pressions (31) are zero, as follows from (27), and the osculating plane 
is not defined. The directions of the principal normal and binormal 
are also not defined. 
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127. The helix. Let (l) be the base in the XY plane of a cylinder with 
generators parallel to the z axis (Fig. 109). Let o be the length of arc of (l) 
measured in a definite direction from its point of intersection A with OX, and 
let the equation of (l) be 


x= (co); y= (a). (32) 


We mark off an arc AN on (l) and draw a straight line NM = ko, parallel 
to the z axis, where & is a given numerical coefficient (the thread of the screw). 


Fie. 109 


The locus of the point M is a helix (L), inscribed on the cylinder. Its parametric 
equations are clearly: 


x= (sc); y= (os); z= ko. (33) 
Let s be the length of arc of (L) measured from the point A. We have: 
ds? = da? + dy? + de? = [p" (0) + y (0) + h*] do®. 


But 9’(c) and y’(a) are respectively equal to the cosine and sine of the angle 
formed by the tangent to (l) with the x axis [I, 70], whence g’*(a) + p”?(o) = 1, 
and we can write the above expression as 


ds = Y1 + k? do, 
so that 
s=VI1+#o, 


We now find the cosine of the angle formed by the tangent to (L) with the 
z axis: 
dz dz de k ; 
yds do ds yite’ 


this gives the first property of a helix: the tangent to a helix forms a constant angle 
with a fixed direction. 
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We recall the third of expressions (28), which gives in this case: 


o= L 


- or y =0, 


so that the principal normal to a helix is perpendicular to the z axis, i.e. to 
generators of the cylinder. On the other hand, it is perpendicular to the tangent 
to the helix. A generator and a tangent are easily seen to define the tangent 
plane to the cylinder at the point of the helix concerned, and it follows from 
the above that the principal normal is perpendicular to this tangent plane. 
Hence we have a second property of the helix: the principal normal at any point of 
a helix coincides with the normal to the cylinder on which it is traced. 

We now return to the cosines y, y,, Yz of the angles formed by the z axis 
with the directions of a movable triad of the helix. On noting that y? + y? + 
+ x2 = 1 and that y and y, are constants, as we saw above, we can conclude 
that y, is also constant. The third of expressions (28,) gives in the present case 
— (y/ 0) — (y,/+) = 0, whence we see that the ratio 9/7 is constant; this gives us 
the third property of a helix: the ratio of the radius of curvature to the radius of 
torsion is constant along a helix. Let r denote the radius of curvature of the 
plane curve (l). In view of the square of the curvature being equal to the sum 
of the squares of the second derivatives of the coordinates with respect to the 
length of arc, we can write: 


Se = 98 (0) +y”? (0) 


and l 
( oe i + ( oe i T ( z ) = (Gy ua 
+(e) + Ger) ape 
whence 
Met eG) v’2(a) 1 


E T TFE E FEE T E 


or ọ = (1 + k?) r, i.e. the radius of curvature of a helix differs only by a con- 
stant factor from the radius of curvature of the guide-curve at the correspond- 
ing point. If the cylinder is circular, i.e. the guide-curve (l) is a circle, r is con- 
stant, so that ọ is also constant; further, by the third property above, t is 
now likewise constant, i.e. a helix on a circular cylinder has constant curvature 
and constant torsion. 

A further important property of a helix may be described in conclusion. 
If two points are taken on a cylinder, the shortest distance between them is 
given by the helix passing through the points. A helix on a cylinder is exactly 
analogous in this connection to a straight line on a plane. The present property 
is usually stated as: the geodesics of a cylinder are helices. The geodesics of a surface 
are defined in general as the curves on the surface giving the shortest distance 
between two points. 

If we roll out the cylinder on the XZ plane by rotating it about a generator 
passing through the point A, the helix becomes a straight line on the plane since 
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the ratio of arc AN to the straight segment NM retains the constant value 
1/k. Lengths will be preserved in the unrolling, and the above property — that 
the helix gives the shortest distance between two points of the cylinder — 
becomes obvious. This property is in fact directly related to the second property 
of a helix, i.e. to the fact that the principal normals of a helix coincide with 
the normals to the cylinder. It may be shown generally in geometry that the 
principal normals to the geodesics of any surface are identical with the normals 
to the surface. 


128. Field of unit vectors. Let t be a field of unit vectors, i. e. a unit vector t 
is given at every point of space. We deduce a simple and important expression 
for the curvature vector N of vector lines of the field. If coordinates (x, y, z) 
are taken, and s is the length of are of a vector line, we can write: 


de  ,. dy _,, dd 


v= diy Oly ~~ dw, Oy dy + dt _ dz 
Di ds Ox ds © dy ds Oz ds 
or 
at at at 
Te oe x x x 
Ne=- RR ay T e 


On differentiating the equation 
+t t= 
with respect to x, we get: 


ot. Oty Ot, 
«Gg ty Gg Fe ae = 


By subtracting this sum from the above expression for N,, we can write: 


at». at, ) ( Oty Oty } 
Nx ( d On J? an Oy J” 
ie. N, = (curl tx t),, whilst similar expressions can clearly be written for the 


other two components. The required expression for the curvature vector of 
a vector line is thus: 


N=curltxt. (34) 


The necessary and sufficient condition for these lines to be straight is that 
the length of N, i.e. the curvature 1/ọ, is zero [125]. Hence the necessary and 
sufficient condition for the vector lines of a unit field t to be straight is that: 


curltxt = 0. (35) 


Furthermore, we saw in [110] that the necessary and sufficient condition 
for the existence of a family of surfaces orthogonal to the vector lines is: 


curlt« t= 0. (36) 
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Conditions (35) and (36) can only be satisfied simultaneously when curl 
t = 0, since non-vanishing of this vector implies its being parallel to t by (35), 
whilst (36) means that the two vectors are perpendicular. Hence, the vector lines 
of a unit field t are normal to a family of surfaces only when curl t = 0. This 
proposition plays an important part in explaining the principles of geometrical 
optics. 


§ 13. Elementary theory of surfaces 


129. The parametric equations of a surface. We have so far con- 
sidered the equation of a surface in space with (x, y, 2) axes in the ex- 
plicit form z = f(x, y) or implicitly as 


The equation of a surface may be written parametrically, the coor- 
dinates of its points being expressed as functions of two variable para- 
meters u and v: 


= g(u, v); Y= ypu, v); z= w(u, v). (38) 


We shall assume that these functions are single-valued and conti- 
nuous, and possess continuous derivatives up to the second order in a 
certain domain of variation of parameters (u, v). 

If we substitute the expressions for the coordinates in terms of u 
and v in the left-hand side of (37), we must get an identity in u, v. 
We have on differentiating this identity with respect to the independ- 
ent variables u, v: 


If we consider these as two simultaneous equations in 3F/əðx, 
aF/oy, 0F/dz and use the algebraic lemma given in [104], we get: 


OF i (32 _ p Oy : Oy 
=~ Ou Ov Ou or ). 


where k is a coefficient of proportionality. 
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We assume that k and at least one of the differences in brackets on 
the right-hand sides of the above expressions differ from zero. 
We denote the three differences as follows, for brevity: 


Op do do Oy d(iy,z) . 


ðu Ov Ou Ov d(u,v) ’ 


Ow i Op Oy i Oo _ d(z,æ) , 
du Ov ðu av — du, v) ” 


Op Oy op , 3p d(x, y) 
Ju av ou a — d(u,v) 


We know from [I, 160] that the tangent plane at a point (a, y, 2) 
of our surface can be written as: 


or or or 
Ox (X x) ! i. ~- (Y y) i az (Z 2) E 0, 


or, if 0F/da, OF /oy, 0F/dz are replaced by their proportionate values, 
the tangent plane may be written: 


_a(y,z)_ d(z, x) d(x, y) Ei 
Tay (22) + aay V en A IS (89) 


The coefficients in this equation are known to be proportional to the 
direction-cosines of the normal to the surface. 

The position of a variable point M of the surface is characterized 
by the values of the parameters u, v, and these are usually referred 
to as the coordinates of the point on the surface. 

On assigning constant values to u and v, we obtain two families of 
lines on the surface: the so-called coordinate lines u = C,, along which 
only v varies, and the coordinate lines v = C,, along which only u 
varies. The two families give rise to a coordinate mesh on the surface. 

Let us take the example of a sphere of radius R with centre at the 
origin. The parametric equations of the sphere may be written in the 
form: 


z= Rsinucosv; y= Rsinusinv; z= Rcosu. 


Coordinate lines u = C, and v = C, evidently consist here of parallels 
and meridians of the sphere. 

We can characterize a surface without reference to coordinate axes 
by the radius vector r(u, v) drawn from a fixed point O to a variable 
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point M of the surface. The partial derivatives of this radius vector 
with respect to the parameters, r; and rý, are evidently vectors directed 
along the tangents to the coordinate lines. The components of these, 
vectors on the axes OX, OY, OZ are, by (38), pa, Pu, œu and Po, Yh, ws 
and it is clear from this that the coefficients in the equation of the 
tangent plane (39) are the components of the vector product rj, x rj. 
This latter is perpendicular to the tangents r, and r, and is therefore 
directed along the normal to the surface; the square of its length is 
clearly given by the scalar product of r X rp with itself, or to put 
the matter more simply, by the square of this vector product.t The 
unit vector along the normal to the surface plays an important part 
later, and we can evidently write this as: 


tu X ro 
O Veu XG)? i 

If we change the order of the factors in the vector product, we get 
the opposite direction for (40). We shall later fix a definite order 
for the factors, i.e. we shall define precisely the direction of the 
normal to the surface. 

Let M be any point of the surface and let (L) be any curve lying on 
the surface and passing through the point M. The curve will not in 
general be a coordinate line, and both u and v will vary along it. The 
direction of the tangent to the curve will be given by the vector 
ry -+ ry dv/du, if we assume that the parameter v is a function of u 
possessing a derivative in the neighbourhood of M along (L). It is 
clear from this that the direction of the tangent at any point M of a curve 
lying on a surface is fully defined by the quantity dv/du at this point. 
We assumed when defining the tangent plane and deducing its equa- 
tion (39) that functions (38) have continuous partial derivatives at 
and in the neighbourhood of the point in question and that at least 
one of the coefficients of equation (39) differs from zero at the point. 

If d(x, y)/d(u, v) 4 0 for u = uo, v = vy, the same will be true in 
some neighbourhood of these values. This neighbourhood transforms, 
by the first two of expressions (38), into a neighbourhood of the values 
Lo = P(Uo, Vo), Yo = P(o Vo), and the first two of equations (38) 
can be solved with respect to u and v [I, 157] for (x, y) sufficiently 
close to (£o Yo), ie. u and v can be expressed in terms of x and y. 


t In general, we shall denote the square of the length of a vector A as A?, 
ie. the scalar product A ° A. 
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Substitution of these expressions in the third of equations (38) gives 
the explicit equation of the surface z = f(x, y) in the neighbourhood 
of the point concerned. 


130. Gauss first differential form. We now consider the square of 
the differential of arc of any curve on a surface: 


ds? = dz? + dy? + dz? = 
_ ( & Oa 2 oy ay 2 əz dz 2 


On removing the brackets, we get what is known as the Gauss first 
differential form: 


ds? = E(u, v) du? + 2F (u, v) du dv + G(u, v) dv?, (41) 


where 
rn (SY TE] 
Pls ae ae oe oe ae (42) 
con (SP (BT HEY 
or 
Bar}; Porm; G= re (42,) 


We can show, exactly as in [119], that the vanishing of the coeffi- 
cient F is the necessary and sufficient condition for the coordinate 
lines u = C, and v =C, to be mutually perpendicular. The curvi- 
, linear coordinates u, v on the 
surface are called orthogonal co- 
ordinates in this particular case. 

We now find an expression for 
an elementary area of the surface 
in terms of the coefficients of (41). 
We take the small area bounded 

Fic. 110 by two pairs of neighbouring 

coordinate lines (Fig. 110). Let 

(u, v) be the coordinates of vertex A. The sides AD and AB are 

respectively rj du and ry dv. If we look on the small area as a paralle- 

logram [cf. 57], its area can be written as the length of the vector 
product of these two vectors, i.e. 


dS = |r, du xr, do| = |r, X r| du dv. 
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We have for the square of the length of the vector product: 
oy 0z dz oy )' + ( dz Ox Ox Oz y eu 


Qu æ ou w 


Oa oy Oy Ox \2 
+(e æ ðu ` DE 


whence, by identity (26) of [124]: 
(rXm) = EG — F?, (43) 
so that we finally have for an elementary surface area: 


d8 = VEQ — F? du dv. (44) 


Furthermore, by substituting (43) in (40), we can write the ex- 
pression for the unit normal to the surface as 


es 
ru X To 


m= GF 
It may be pointed out that EG — F? is positive, by (43). 


(45) 


131. Gauss second differential form. We consider a curve (L) on a 
surface and let t be its unit tangential vector. This is obviously per- 
pendicular to the unit normal vector to the surface, i.e. t -m = 0. 
We have on differentiating this relationship with respect to the length 
of arc s of (L): 


where ọ is the radius of curvature and n the unit principal normal to 
L). The above equation can be rewritten as 


n-m dr dm cosp ss dr-dm 


eo o de ` ds O e ds?’ 


where ¢ is the angle between the normal to the surface and the princi- 
pal normal to (L). We can express the differentials dr and dm in terms 
of the coordinate parameters u and v and thus write: 


cosy _ — (ry du + rdv) - (mj du + my dv) 


; i (46) 


On removing the brackets in the numerator, we get the Gauss second 
differential form: 


— (r, du + r, dv) - (m; du + m; dv) = 
= L(u, v) du? + 2M (u, v) du dv + N(u, v) de®, 
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where 
A r 1 , , 1 , ? 
L= — r; mi; M = — > (tu: my) — -y (to m,); (47) 
N= — r, m, 


so that (46) takes the final form: 


cosp _ Ldu? +2M dudv + N de? 48 
e  Edw+2Fdudv+Gd ' (48) 


We now notice alternative expressions for the coefficients L, M, 
and N. Differentiation of the obvious relationships 


+ , 
r,:m = 0, r'm = 0 


with respect to the independent variables u, v gives us the four 
equations: 
rpm -+r m; = 0; rg. m -+ r.m = 0; 


” , , X , 
ry m -+ rp: m, = 0; rym -+ r, m, = 0, 


and from these, together with (47), we can write the following ex- 
pressions for the coefficients of the Gauss second differential form: 


L=rp:m; N=rhem; (49) 
M = r}, m = — ri m, = — ry mi. 


On recalling expression (45) for the vector m, we can write (49) 
in the form: 


L= rus. (ru Xr) i M= Truo’ (Fi X ro) é 
VEG@—F: ’ VEG —F? ’ 

N = (tu xr) 50 

VEG— Fi (50) 


We now take the case when the equation of the surface is given 
explicitly: 

z= f(x,y). (51) 

The role of parameters is now played by x and y, and we have the 


following expressions for the components of the radius vector and its 
derivatives with respect to the parameters: 


r(x, y,2); r,(1,0,p); ¥y(0, 1, 9) 
riya (0, 0, r); Try (0, 0, 8); Tye (0, 0, t), 
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where 


of. of of ar. OY 


P= ee I= Gy? 7 GE? 8 = aay? ay 


(52) 


Use of (42,) and (50) gives us the coefficients of both Gauss forms as: 
F=1+ 7p; F=pq; G=1+¢; 


Set that : Meio et Wis F « 
Vit pt’ FPFE’ ETET 

We now make a definite choice of axes by taking the origin at a 
point M, of the surface, OX and OY in the tangent plane, and OZ 
along the normal to the surface at M,. We use the zero subscript 
to indicate that a magnitude is being taken at M,. With the present 
choice of axes, the cosines of the angles formed by the normal to the 
surface with OX and OY at M, will be zero, so that we have [62] 
Po = Yo = 0, whilst (53) gives at M,: 


Ly=%; Mi= s; Ny = to. (54) 


(53) 


132. The curvature of lines ruled on surfaces. We return to (48). 
Its right-hand side depends on the values of the coefficients of the 
two Gauss forms and on the ratio dv/du. The last statement is im- 
mediately clear on dividing numerator and denominator by du?. The 
coefficients are functions of the parameters u, v and have a definite 
numerical value at a given point of the surface. As we saw in [129], the 
ratio dv/du characterizes the direction of the tangent to the curve con- 
cerned. We can therefore say that both sides of (48) must have a 
definite value if we fix the point on the surface and the direction 
of the tangent to the curve on the surface. If we take two curves 
through a fixed point on the surface with the same tangential 
direction and the same principal normal, the angle » will be the same 
for both curves, and therefore, by (48), @ will also be the same. We thus 
have the following theorem: 

First THEOREM. Two curves on a surface with the same tangent 
and principal normal at a given point have the same radius of curvature 
at this point. 

If an arbitrary line (L) is drawn on a surface and passes through 
a point M, the plane containing the tangent and principal normal 
to (L) at M will cut the surface in a plane curve (Z,) having the 
same tangent and principal normal as (L) and hence having the 
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same radius of curvature. The theorem thus proved enables the 
investigation of the curvature of any curve on a surface to be reduced 
to the study of the curvature of a plane section through the surface. 

A normal section of a surface at a given point M is defined as the 
section by any plane passing through the normal at M. There is 
obviously an infinite set of normal sections, one particular section 
being specified by assigning a definite tangential direction in the 
tangent plane to the surface, i.e.we fix the value of dv/du. The principal 
normal to a normal section must be equal or opposite to the vector m, 
so that the angle p equals 0 or x, whence cos p = +1. 

Let (L) be any curve on a surface through a point M. The normal 
section corresponding to (L) at M is defined as the normal section 
having a tangent in common with (L) at M. Let ọ be the radius of 
curvature of (L) and R the radius of curvature of the corresponding 
normal section. We confine our attention here to the point Jf. Since 
both curves have the same tangent, the right-hand side of (48) is 
the same in both cases, and we can write 

cos o Ja be 


Ro? ie. 0=+#-cosg, (55) 
where is the angle between the principal normal to the curve and 
the normal to the surface. The last formula expresses the following 
theorem: l 

Seconp THEOREM (Meusnier’s theorem). The radius of curvature at 
any given point of a curve on a surface is equal to the product of the 
radius of curvature of the corresponding normal section at the point 
with the cosine of the angle between the normal to the surface and the 
principal normal to the curve. An alternative statement of the theorem 
is: the radius of curvature of any curve on a surface is equal to the 
projection of the radius of curvature of the corresponding normal 
section (marked off along the normal to the surface) on the principal 
normal to the curve. 

In the case of a sphere, a normal section is a great circle, and if 
we take (Z) as any circle traced on the sphere, (55) reduces to the 
obvious relationship between the radii of the two circles (Fig. 111). 

By the second theorem, investigation of the curvature of a curve 
on a surface reduces to investigation of the curvature of the normal 
section at a given point of the surface. We have seen that, for a 
normal section, we must take cos p = +1 in (48). We shall agree 
to refer the (—) sign to @ when it occurs, i.e. we shall take the radius 
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of curvature of a normal section as negative if the principal normal 
to the section is in the opposite direction to m, i.e. opposite to the 
chosen normal direction to the surface. With this agreement, the 
formula is valid for normal sections: 


1 _ Ldu?+2M du do + N de? (56 
E Baw piPdudo+Gart ° ) 


It may again be recalled that the coefficients of the differential 
forms on the right of this expression have definite values, since we 
have fixed the point on the surface and 
the value of 1/2 depends only on the ratio 
dv/du, ie. on the choice of tangential di- 
rection. The denominator on the right of 
(56) always has a positive value, since it 
is the expression for ds’, and the sign of 
the curvature 1/R of a normal section is 
therefore defined by the sign of the numer- 
ator. We may distinguish the following 
three cases: 

1.If M?—LN <0 at the point taken, 
1/R has the same sign for all normal sec- 
tions, i.e. the principal normals to all normal sections are directed 
towards the same side. Such a point on a surface is described 
as elliptic. 

2. If M? — LN > 0, 1/R has different signs, i.e. there are normal 
sections at the point taken with opposite directions for the principal 
normal. A point of this kind is called hyperbolic. 

3. If M? — LN = 0, the numerator of the right-hand side of (56) 
is a perfect square and 1/R does not change sign here, although it 
vanishes for one particular normal section. This kind of point is 
called parabolic. 

It may be noted that the numerator of the right-hand side of (56) 
vanishes whilst changing sign in the hyperbolic case, and there are 
two normal sections with zero curvature. There are no such sections 
in the elliptic case. 

We take axes as in [131], with the origin at the point of the surface 
taken and OX, OY situated in the tangent plane. 

By (54), equation (56) takes the form: 


Fia. 111 


1 __ radz? + 28, dx dy + t, dy? 
R ds? 
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The tangent to the normal section lies in the XY plane, and the 
ratios da/ds, dy/ds are equal respectively to cos 0 and sin 0, where 6 
is the angle formed by the tangent with the z axis. The above expression 
thus becomes: 


= = Ta cos? 6 + 28, cos 8 sin 0 + ty sin? 0. (57) 

This is an explicit expression of the dependence of the curvature 
1/R on the direction of the tangent, characterised by the angle 6. 
Now, the point will be elliptic of så — rẹ tọ < 0, hyperbolic if sê — 
— Tofo > 0, and parabolic if så — rot, = 0. 

In the case s — Toto < 0, the function z = f(x, y) will have a 
zero maximum or minimum at the point concerned [I, 163], i.e. the 
surface near the point is situated on one side of the tangent plane. 
With sa — rot, 0,> there is neither a maximum nor minimum, i.e. 
the surface is situated on both sides of the tangent plane in any 
neighbourhood of the point. Finally, at a parabolic point where 
85 — To to = 0, nothing definite can be said of the disposition of the 
surface relative to the tangent plane. 

It follows directly from (53) that the sign of (M? — LN) is the 
same as that of (s? — rt) for any choice of XYZ axes, so that the 
point is elliptic for s? — rt < 0, hyperbolic for $ — rt > 0, and 
parabolic for s? — rt = 0. 

The same surface may have different kinds of point. For instance, 
in the case of a torus, obtained by rotation of a circle about an axis lying 
outside it but in the same plane [I, 107], points lying on the outward 
side are elliptic whilst points on the inward side are hyperbolic. 
These domains are separated from each other by the extreme parallels 
of the torus, all the points of which are parabolic. 


133. Dupin’s indicatrix and Euler’s formula. Having fixed the 
coordinate axes as in the previous article, we draw an auxiliary 
curve in the tangent plane, i.e. the XY plane, as follows: we mark 
on each radius vector from the origin O a length ON = +R, 
where R is the radius of curvature of the normal section for which 
the radius vector taken is a tangent. The (+) sign is taken so that 
the quantity under the radical is positive. The locus of the point N 
is a curve called the Dupin indicatrix. The curve has the following 
property by construction: the square of the radius vector to any 
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point of it gives the absolute value of the radius of curvature of the 
normal section of which the radius vector is a tangent (Fig. 112). 

The equation of the indicatrix is obtained as follows: let (&, 7) be 
the coordinates of any point N of 
it. We have by construction: 


¿= |+ Rcosð; 
u= V+ Bsin9, 


i.e. 
& = + Rcos? 0; 
K = + Resin? 6, 


Fie. 112 


where the upper sign must be ta- 
ken for positive R, and the lower 
sign for negative R. Multiplication of both sides of (57) by +R 
obviously gives us: 


To È + 28, &y +t) 7? = +1. (58) 


This is the equation of the indicatrix. The curve gives a geometrical 
picture of the change in radius of curvature as a normal section 
rotates about a normal to the surface. In the case of an elliptic point, 
(58) gives an ellipse and a definite sign has to be taken on the right. 
We get two conjugate hyperbolas from (58) in the case of a hyperbolic 
point; with a parabolic point, the left-hand side is a perfect square 
and (58) may be re-written as: 


k(ag + bn)? = +1, ie. (E+ bq = +7 =P 


or 


giving a set of two parallel straight lines. The curve has its centre 
at O in all three cases and has two axes of symmetry. We can take 
these as the x and y axes, in which case we know that the term in 
&n falls out on the left-hand side of (58), i.e. we must have s, = 0, 
so that (57) now becomes: 


l 


F =r cos? 0 + t sin? 6. (59) 
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The geometrical significance of coefficients ry and tọ may be seen 
as follows. On setting 0 = 0 in (59), we get the curvature 1/R, of the 
normal section to which the x axis is the tangent, and we have r, = 
= 1/R,. Similarly, on setting 0 = a2, we get ¢)=1/R,, where 1/R, 
is the curvature of the normal section to which OY is tangential. 
We get Euler’s formula on substituting the values found for rọ and 
ty in (59): 

1 cos? 6 sin? 6 
E =a R, + P, . (60) 

It will be recalled that our x and y axes are the axes of symmetry 
of curve (58). We assume that 1/R, Æ 1/R, and that, for instance, 
1/R, > 1/R,. It follows at once from (60) that 1/R attains its greatest 
value at 6 = 0 and 0 = 2, and its least value at 0 = 7/2 and 0=382/2. 

The result obtained may be stated as the following theorem: 

Tuirp THEOREM. There exist at any point of a surface two mutually 
perpendicular directions lying in the tangent plane for which the curvature 
1/R has a maximum and minimum; if the curvatures corresponding 
to these directions are 1/R, and 1/R,, the curvature of any normal section 
is given by (60), where 0 is the angle that the tangent to the normal section 
forms with the direction that gives 1]R,. 

We refer to R, and R, as the principal radii of curvature of normal 
sections at the point concerned. The two directions in the tangent 
plane that give rise to these are called the principal directions. 
Moreover, in the hyperbolic case it is useful to distinguish two further 
directions in the tangent plane, those of the asymptotes to the indi- 
catrix. The radius vectors to the indicatrix in these asymptotic 
directions are infinite, and the curvatures of the corresponding normal 
sections are zero at the point taken. 

In the elliptic case, R, and R, have the same sign, whilst their 
signs are opposite in the hyperbolic case. The curvature of one of 
the principal normal sections becomes zero in the parabolic case; 
if we take say 1/R, = 0, we get the formula for the parabolic case: 


_1 __ _cos?6 
gH, 

A particular case of elliptic points on a surface may be noticed, 
when R, and R, are equal. We have in this case from (60): 1/R = 1/R,, 
i.e. all normal sections have the same curvature at the point taken. 
A point of this sort on a surface is called an umbilic. A surface becomes 
very like a sphere in the neighbourhood of an umbilic. It can be shown 
that a sphere is the only surface, all the points of which are umbilics. 
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134, Finding the principal radii of curvature and principal directions. 
The basic expression (56) for the curvature of a normal section may 
be written in the form 


(L — ER—) du? + 2(M — FR-) dudv +(N — GR-) dv? = 0. (61) 


On dividing by dv? and bringing in the auxiliary t = du/dv, 
characterizing the tangential direction to the normal section, we get 
the equation: 


y(k-1, t) = (L— ER) ? + 2( M — FR) t + (N—GE) =0, 


which gives the curvature #-1 of the normal section as a function of t. 
The value of 2-1 must be a maximum or minimum for the principal 
directions, so that the derivative of R-! with respect to £ must vanish. 
But this derivative is obviously given by [I, 69]: 


dp 
dR “dt 
dt dp ’ 
dR“ 


so that the derivative dø/dt must vanish for the principal directions, i.e. 


-y = (L — ER~) t + (M — FR) = 0. 


On replacing t = du/dv and multiplying by dv, we get: 
(L— ER-) du + (M — FR) dv = 0. (62) 


If we were to divide (61) by du? and take t, = dv/du as the variable 
characterizing the direction of the tangent, we should arrive in the 
same way at the equation for the principal directions: 


(M — FR-1) du + (N — GR-}) dv = 0. (63) 


On taking the term in dv to the right-hand side in (62) and (63) 
and dividing the respective sides of the equations into each other, 
we obtain a quadratic equation for the curvatures 1/R, and 1/R, of 
the principal normal sections: 


(EG — F?) + (2FM — EN — GL), + (LN — M?) =0. (64) 


The quantity: 


K=FF, (65) 
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is called the Gaussian curvature of the surface at the given point, 
whilst the quantity 


1 1 1 
“+ G44) oo 
is known as the mean curvature. We get directly from the quadratic 
equation (64) expressions for the Gaussian and mean curvatures in 
terms of the coefficients of the first and second Gauss forms: 


LN— M? |. Haz EN —2FM + GL 


= “EGF? aea ry (67) 


We now write equations (62) and (63) as 
(Ldu + M dw)R = Edu + F dv; 
(M du + N dv) R = F du + G dv. 


We eliminate & by dividing these equations into each other, and 
obtain after simple rearrangement: 


(EM — FL) du? + (EN — GL) du dv +- (FN —GQM)dv?=0. (68) 


Division by du? gives us a quadratic equation in dv/du, the roots of 
which characterize the principal directions at a point of the surface: 


d d 
= 91 luo); = plu, 0). (69) 


135. Line of curvature. A line of curvature is defined as a line on 
a surface such that the tangent at every point is along a principal 
direction. Since there are two principal directions at every point of 
a surface, there will be two families of lines of curvature on the 
surface and the families will be mutually orthogonal. The aggregate 
of all lines of curvature thus gives rise to an orthogonal mesh on the 
surface. Equation (68) or the equivalent equations (69) represent 
differential equations for the lines of curvature; integration gives us 
v in terms of u, and substitution of the expression obtained in the 
equation of the surface leads to the actual equations of the lines of 
curvature. 

Let us consider the conditions under which a given coordinate 
mesh on a surface represents a mesh of lines of curvature. First of 
all, the mesh must be orthogonal if it consists of lines of curvature, i.e. 
we must have F = 0. Furthermore, if the coordinate lines u = C, 
and v =C, are to be lines of curvature, equation (68) must be 
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satisfied on substituting constants for u and v. If we take into account 
the result already obtained, that F = 0, we have GM =0 and 
EM =0. But we have seen that EG — F? is positive, so that E 
and G cannot be zero; it follows from the above that we must have 
M = 0. Hence a necessary condition for the coordinate mesh to be 
a mesh of lines of curvature is that F = M = 0. Conversely, if this 
condition is satisfied, the differential equation (68) of the lines of 
curvature has the solution u = C, and v = C, i.e. the coordinate 
lines are lines of curvature; hence 
we have the following theorem: a 
necessary and sufficient condition for 
a coordinate mesh to be a mesh of 
lines of curvature consists in the va- 
nishing of the middle term in both 
the Gauss differential forms every- 
where on the surface, i.e. F = M=0. 

It is possible to give a different 
definition of line of curvature to 
that at the beginning of the present 
article. Let (L) be a curve on a sur- 
face. The normals to the surface 
along (L) form a family of straight 
lines with a single parameter defin- 
ing the position of the point on (Z), andthe family will not in gen- 
eral have an envelope. An envelope will exist, however, if the 
curve (L) is chosen in a particular way.t The conditions for a suit- 
able choice will now be explained. 

Let the curve (Z) be chosen on the surface so that there exists 
an envelope (£,) of normals to the surface along (L) (Fig. 113). Let 
r denote the radius vector to a point of (L), r, the corresponding 
radius vector to a point of (L,), and a the algebraic length of the normal 
measured from (L) to (L); then we can clearly write: 


r, =r-+am, (70) 


where m is as usual the unit normal to the surface. Since (L) is the 
envelope of the normals, the vector dr,, directed along its tangent, 


t A family of straight lines in space, containing a single parameter, in general 
has no envelope, i.e. the lines are not tangents to any one curve. There is an 
envelope only in particular cases. 
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must be parallel to m, and we can write dr, = bm, where b is a scalar. 
We get on differentiating (70): 


bm=dr-+adm+dam, ie. dr+adm=cm, (71) 


where c is a scalar. We show that c = 0. We form the scalar product 
of both sides of (71) with m: 


dr-m + adm-m =c. 


Vector dr is directed along the tangent to (L), i.e. is perpendicular 
to m, so that dr- m = 0. Moreover, it follows as usual from the 
equation m : m = l1 that dm-m=0, and the above equation 
therefore in fact gives c = 0, whilst (71) becomes: 


dr + adm = 0. (72) 


This expression is generally known as Rodrigues’ formula. It has 
been obtained from the assumption that normals to the surface 
along (L) have an envelope. We now assume the converse, that 
expression (72) is valid along a curve (L) on the surface. Formula 
(70) now defines a curve (L,); on differentiating the formula and 
taking into acceunt (72), we get dr, = dam, i.e. the direction of m 
and the tangent to (Z,) are parallel. In other words, normals to the 
surface along (L) are tangents to (Z4). Hence (72) gives the necessary 
and sufficient condition for the existence of an envelope of normals 
to the surface along (L). It must be noted that the envelope can 
degenerate to a point; the normals in this case form a cylindrical or 
conical surface, where it may be shown that condition (72) must 
still be fulfilled. 

We write (72) in the expanded form: 


r, du + r, dv + a(m, du + m, dv) = 0 


and form the scalar product with rj. 
We get by (42,), (47) and (49): 


E du + F dv + a(— Ldu — M dv) =0, 


which is the same as equation (62) with a = R. Similarly, on forming 
the scalar product with r,, we obtain equation (63). It is easy to 
show the converse, that equation (72) is obtained with a = R from 
(62) and (63), which define the principal radii of curvature and principal 
directions. We shall not dwell on the proof. Condition (72) for the 
existence of a normal envelope is thus equivalent to (62) and (63), 


136] DUPIN’S THEOREM 383 


a being one of the principal radii of curvature. These remarks lead 
us to the following result: lines of curvature on a surface are charac- 
terized by the property that the normals to the surface along them 
have envelopes (or form cones or cylinders), the length of the normal 
between the surface and envelope being equal to one of the principal 
radii of curvature. 

If a plane curve rotates about an axis in its plane, the lines of 
curvature of the resulting surface of revolution are its meridians and 
parallels. The normals to the surface in fact form a plane along a 
meridian, and a cone along a parallel. 


136. Dupin’s theorem. Let three families of mutually orthogonal 
surfaces in space be: 


Plx, Y, 2) =Q; PL, Y, 2) =Q; O(2,Y,2) = Qz- 


They form a mesh of orthogonal curvilinear coordinates in space 
[119]. The radius vector r from the origin to a variable point M in 
space is characterized by the curvilinear coordinates q,, q>, qa of the 
point. The partial derivatives rj,, r4, r4 give vectors directed along 
the tangents to coordinate lines, and since the coordinates are orthog- 
onal we can write the vector equations: 


"r, =0. (73) 


, l EDERT a P E a A 
Kaa Ega =O; Fa Ig =O; Talos 


q2 ° "Qs a43 qı 


We differentiate the first of these equations with respect to qu 
the second with respect to g,, and the third with respect to q,: 


n t , n ee 
Taiga * Fas + Taa Fangs = 0 
n , , ” UE 
Taoqs * Yq H Tas Taig, T 0 
n , , ” a 
Tags ` "ga + Pa gege T 0. 
From these we obtain at once: 
” , n aE E E 
Torqa* Tan = Taag * Tox = aag Ton = O- 
We take together the three equations: 
, ee are een ae 
Toy Top = Tga * Ega = Taiga “Tos = O- 


It follows from these that the vectors rg, ra» and rg, are all 
perpendicular to the same vector rj, and are therefore coplanar, 
whence it follows that [105] 


Figa’ (Eg X Tga) = 0. (74) 
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We now take the coordinate surface q = C. Parameters q, and q3 
are coordinate parameters on it, and the coordinate lines q, = C and 
qa =C are the lines of its intersection with two other coordinate 
surfaces of our orthogonal coordinate set in space. We had the follow- 
ing expressions in [130, 131]: 


a , r 
F=r' -r — Fag’ (rg, X¥q,) 
= Ty, = 1h Vas a 


a Ma“ 


and equations (73) and (74) show that here F = M = 0, i.e. the 
qı and q, coordinate lines are lines of curvature on the surface q} = 
= const. This leads us to Dupin’s theorem: given three families of 
mutually orthogonal surfaces in space, any two surfaces of different 
families intersect in a line which is a line of curvature for both the 
surfaces. 


137. Examples. 1. The equation of the oblate ellipsoid of revolution: 
x? y2 g2 
e teat. eee 
can be written in the parametric form: 


x =acosusinv; y=asinusinv; Z= C COSV. 


The coordinate lines u = c, are clearly the lines of intersection of the ellipsoid 
with the planes y = x tanc,, passing through the axis of rotation, i.e. they 
are meridians, whilst the coordinate lines v = c, are parallels, obtained by 
the intersection of the ellipsoid with the planes z = c cos c,, perpendicular 
to the axis of rotation. On applying formulae (42) and (50) of [130, 131] and 
taking into account the fact that x, y, z are the components of r, we get: 


HE=da'sin?v; F=0; G=a?cos*v + c*sin v; 


ac sin? v ac 


L ==;  M=0; N = 
Va? cos? v +- c? sin? v 


yV a? cos? v + c? sin? v 

The equation F = M = 0 may be foreseen from the fact that the meridians 
and parallels are lines of curvature of the ellipsoid of revolution. The remaining 
coefficients depend only on the parameter v, characterizing the position of a 
point on a meridian. The principal directions are clearly given by the tangents 
to meridians and parallels. The expression (LN — M?) is here positive over the 
entire surface, i.e. every point of the surface is elliptic. We note the Gaussian 
curvature, without working out the principal radii of curvature separately: 


1 LN—M? _ ot 
RR, E@—F? ~~ (atcose + csin? v)? 


K = 
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2. The equation of a cone of the second order: 


2 2 2 
Z Y cA 


“a T e 
may be written explicitly as 
ax? y? 
PSS |= aie 


We easily find by direct differentiation that: 


en y. cyt. 
P= -iiz ’? I= Gre? ~ a2 223’ 
cA ct at 
pe SY = 


abies? i= a? b? z3 * 


On using expressions (53), we can find all the coefficients of the Gauss forms. 
We remark simply that here ri — s? = 0, i.e. every point of the surface is 
parabolic, and one of the principal radii of curvature is infinite. The correspond- 
ing principal direction is evidently along the straight generator of the cone. 

3. We consider the hyperbolic paraboloid 


x2 y? 
z = oar ~ DoF 
Here, r = 1/a?, 8 = 0 and t = —1/b?, so that rt — 8 < 0, and every point 


of the surface is therefore hyperbolic. The two straight generators of the surface 
in this case give the asymptotes of the indicatrix, which consists of two conjugate 
hyperbolas. The situation is similar in the case of a hyperboloid of one sheet. 
4. Ordinary Cartesian, along with spherical and cylindrical, coordinates 
provide the simplest examples of orthogonal coordinates in space. A further 
example of such coordinates may be indicated. We take an equation of a second 
order surface containing a parameter @: 
a? y? T 
ae ttre tape = (e 
where a? > b? > c*, On fixing a point M(x, y, z) and getting rid of the denomina- 
tors, we arrive at a third degree equation in ọ. It can be shown that this equation 
has three real roots u, v, w, contained respectively between the limits 


+o>u> -e —e&>v>—bt; —b >w> —at. (76) 


In fact, the left-hand side of equation (75) approaches (— 1) for large positive 
values of ọ, and has the (—) sign, whilst for @ somewhat greater than (—c?), 
the term z?/(c? +- 9) has a large positive value and the left-hand side of (75) 
has the (+) sign. There must therefore be a value of ọ in the interval (—c*, œ) 
for which the left-hand side of (75) vanishes. Similar reasoning shows the 
existence of roots in the intervals (—b?, —c?) and (—a?, —b?). The three numbers 
u, v, w are called the elliptic coordinates of the given point M(x, y, z). Our discus- 
sion has assumed the non-vanishing of all three coordinates of the point (x, y, z). 
If this is not the case, an equation of lower degree than the third is obtained 
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for 9. If, say, z = 0, whilst x and y differ from zero, equation (75) will give u 
and v, whilst w has to be taken equal to (—c®). 

We now investigate the coordinate surfaces in the elliptic system. On substitut- 
ing 9 = u in equation (75), where u belongs to the interval (—c?, co), we get 
the surface: 

2 
a z 
Fu +F u wey + epu 
which is evidently an ellipsoid, since all three denominators in (77) are positive 
by the first of inequalities (76). On substituting ọ = v, where v belongs to 
(—b?, —¢), we get the hyperboloid of one sheet: 


x2 y? 22 
ago tepo t epo T. (78) 


since here a? + v > b? +v > 0 and œ +v <0. Finally, substitution of ọ = 
= w, where w belongs to (—a?, —b?), gives the hyperboloid of two sheets: 


-4 + = = 1. (79) 


We prove that the three coordinate surfaces are mutually orthogonal 
Subtraction of equations (77) and (78) gives: 
g? y? 22 = 
FEF 7 ewer tT ee =O © 
The direction-cosines of the normals to surfaces (77) and (78) are respectively 
proportional to 


a pee a eee ee, RN 
Fu’ Btu’? +u’ apv’? bpv’? ety’ 

and equation (80) expresses the condition for these normals to be perpendicular, 
i.e. proves that surfaces (77) and (78) are orthogonal. The other two coordinate 
surfaces may similarly be shown to be orthogonal. Dupin’s theorem enables 
us to state that the two families of lines of curvature are obtained on the ellipsoid 
(77) (with fixed u) as the intersections of the ellipsoid with all the hyperboloids of 
families (78) and (79). 


138. Gaussian curvature. We shall explain the geometrical signifi- 
cance of the Gaussian curvature. We take the lines of curvature as 
coordinate lines on a surface. Relationship (72) will be satisfied along 
all these lines, the coefficient a being one of the principal radii of 
curvature, as we have seen. This gives us the following relationships: 


r,+h,m,=0; r,+ Rm, =0. (81) 
With every point M of the surface we associate a point M, of the 


unit sphere, M, being the terminus of the unit vector m drawn 
from the centre of the sphere, where m is the unit normal to the 
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surface at M. This correspondence between points of a surface and 
points of a sphere is usually known as a spherical mapping of the 
surface. The position of the point M, will be determined by the same 
parameters u and v as determine M. Since the coordinate lines are 
the lines of curvature, we have: 


E=r?; F=0; G=r?. (82) 


The radius vector of the spherical mapping M, is, by definition, 
m, and the coefficients of the first Gauss form for the spherical 
mapping are, by (81) and (82): 


Ey =m? =- E; Fo=m, m, =0; (=m? =- (83) 
1 2 


We shall only stop to prove the middle equation, since the others 
follow directly from (81) and (82). Expressions (49) give M = 
= —ry ‘mj = — Tr; > mj. Since we have taken the lines of curvature 
as coordinate lines, M = 0, ie. ry > m; = r; > mọ = 0. On multiply- 
ing the first of equations (81) by mj; or the second by m4, we get 
m; * m; = 0. 

An elementary area of the original surface, and the corresponding 
element of the spherical mapping, are given by 


or, by (83), 
1 
dS, = TR, RJ ds , 

whence it is clear that the Gaussian curvature at a point M has an 
absolute value equal to the limit of the ratio of an area of the spherical 
mapping to the corresponding area of the original surface when the 
latter contracts indefinitely to the point M. This ratio obviously 
characterizes the degree of dispersion of the pencil of normals to 
the surface at points of the elementary area. 

We obtained in [134] an expression for the Gaussian curvature K 
in terms of the coefficients of the two Gauss forms. The expression 
for K given by Gauss himself was only in terms of the coefficients 
E, F, G and their derivatives with respect to u and v. This fact has 
an important consequence, which we must stop to consider. Let 
there be a correspondence between points of two surfaces (S) and 
(S,) such that corresponding points have the same values of par- 
ameters u, v. Each surface will have its own first Gauss form, expressing 
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the square of an element of arc. If the two forms are identical, it 
amounts to saying that lengths are preserved in the correspondence, 
or in other words, that the surfaces can be superimposed on each other. 
In this case, the coefficients E, F, G and their derivatives with respect 
to u and v are the same for both surfaces, so that the curvature K 
has the same value at corresponding points of the surfaces, i.e. when 
a mapping of one surface onto another preserves lengths, the Gaussian 
curvature has the same value at corresponding points. 

In particular, the Gaussian curvature is zero on a plane, and we 
must have LN — M? = 0 on a surface that can be superimposed 
on a plane without distortion of lengths, i.e. every point must be 
parabolic. We have already had the cone and cylinder as examples 
of such surfaces. 


139. The variation of an elementary area and the mean curvature. 
Let (u, v) be the parametric coordinates and r(u, v) the radius vector 
of a given surface (89). If we mark off along the normal m at every 
point M(u, v) of the surface a length MM, of algebraic value n(u, v), 
where n(u, v) is a function of u and v, we get a new surface (81) formed 
by the points M,. We shall represent points M, by the same parameters 
(u, v) as points M, and shall speak of a correspondence having been 
set up along the normals to (8) between points of (8) and (8,). The 
radius vector r (u, v) to the surface (S,) is by definition: r (u, v) = 
r(u, v) + n(u, v) m(u, v). We obtain on differentiating with respect 
to u and v: 


r =r) + nim -+ nmi; rY = ri + nm + nm. 


We now find the coefficients H,, F,,G, of the first Gauss form 
for (S,), on the assumption that the length n and its derivatives 
with respect to u and v are small so that second order terms may 
be neglected: 


By = (09)? = (r4 + nam + mg) - (£4 + ng m + nmi) = 
= ri? + Qn) (rym) + 2n(r; m4). 


The vectors r, and m are perpendicular and r, +m = 0, so that 
(47) gives E, = E — 2nL. Similarly, we find that F, = F — 2nM 
and G, = G — 2n N. Hence: 


E, G — F} = EG — F? — 2n(EN —2F M + GL), 
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or, by (67): 
E,G, — F? = (EG — F?) (1 — 4n H). 


If we take square roots, expand (1 — 4nH)!? by the binomial 
theorem and neglect higher powers of n than the first, we get: 


VE, G, — F} = VEG — F? (1 — 2nH). (84) 


On multiplying by dudv and integrating, we get an expression for 
the difference ôS between the areas of the neighbouring surfaces (8) 
and (S,) to an accuracy of second order terms: 


f j VE, G, — Fi du dv — J VEG— F? dudv = 
(S, 
=— | f 2nH VEG — F? dud (85) 
(s 


or 


ôS = — È ( 2nH d8. 
‘3 


The familiar problem of Plateau, of finding the surface of minimum 
area stretched on a given contour (L), is directly connected with this 
expression. It is easily seen that the mean curvature H must be zero 
on such a surface. If H were say positive on some part o of such a 
surface, we should obtain, by (85), on choosing a small n, also positive 
on ø and zero elsewhere, including in particular (L), a negative value 
for 68: 

ôS = — f f 2nH dg, 
(0) 
and the surface (8,) passing through (L) would have an area less 
than (9), which contradicts our original hypothesis. In view of the 
above, a surface of zero mean curvature is known as a minimal 
surface. 

The formula for differentiating an integral over a variable closed 
surface with respect to a parameter also follows from (84). Let the 
position of a variable closed surface be defined by a parameter A, 
and let its position be (S) for 2 = 2, and (S,), near (8), for À near A). 
We set up a normal correspondence, as above, between points M 
of (S) and M, of (S,). With this, n is a function of u, v, and å, which 
vanishes as an identity in u and v for A = Ay, i.e. 


nu, v, Ay) = 0. (86) 
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Further, let {(V) be a function of points of space which is indepen- 
dent of the parameter å. The value of the integral: 


I) = Wi f(M,) d8, (87) 


will depend on 4, since the form of the surface depends on the parameter. 
We find an expression for the derivative I’(A,). On multiplying both 
sides of (84) by dudv, we can write dS, = (1 — 2nH) dS, and (87) 
can be written as: 


(4) = J § hae z }(M,) 2nH a8. 


The domain of integration is now the original surface (S) and is 
no longer dependent on A, and we can use the ordinary rule for 
differentiating under the integral sign [80]. Let the point M of (8) 
correspond to the point M, of (S,), so that MM, = n(u, v) is normal 
to (8), i.e. has the direction m. Differentiation of f(M,) with respect 
to A gives at A= Ay: 


wy AM) — iM) ys FN) — M) MM, __ ƏM) | on 
age S ape ag o OS 


where m is the direction of the normal m. On noting that n vanishe 
for A = A, and writing 0n/9A, for the value of the derivative at 2 = A, 
we get: 


I’ (Ag) a a) ~ dis — [aoe a a: (88) 


Let the equation of the variable surface (8,) be given implicitly as: 
GM; A4)=0 or (x,y,z, A) = 0. (89) 


Differentiating with respect to A both directly and via M,, as in 
the case of the function f(M,), we get for A = Ay: 


Op(M, 20) pM, Ao) 2 ðn 
m Ba a T Y 


dA, 
If we find 3n/3å, from this and substitute in (88), we get the follow- 
ing expression for the derivative: 
we. 


oS ole en 
(9) oe 
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If the integrand f in integral (87) also contains the parameter å, 
an extra term has to be added on the right of (90), as was the case 


in [120], of the form: 
of 
f faras. 
(S) 


140. Envelopes of surfaces and curves. We introduced, the idea of 
an envelope of plane curves in [10], when studying the particular 
solutions of first order ordinary differential equations. In a similar 
way, the solution of partial differential equations leads us to the 
concept of an envelope of surfaces, a brief account of which now 
follows. 

Let us be given a family of surfaces with a single parameter, 


F (x,y, 2,a) = 0. (91) 


A definite surface of the family is obtained on fixing the numerical 
value of a. We consider a new surface (8) which also has the equation 
(91), but with variable a, found from the equation: 
ƏF(z,y,z,a) _ 
e Os (92) 
We can say that the equation of (S) is obtained by eliminating a from 
equations (91) and (92). If we take a fixed a = ay, on the one hand 
we get a definite surface (S,) of family (91), and on the other hand, 
the substitution of a = a, in (91) and (92) gives us a line (l) on the 
surface (8), such that (S) and (S,) have (lọ) in common. We shall 
prove that the surfaces have a common tangent plane along (J,). 
Since a is constant, the projections dz, dy, dz of an infinitesimal 
movement along the surface (91) must satisfy 
or or or 
Since a is variable for surface (8), we must write here: 


oF or 
ð 


OF 
-5z de + By dy + m dz+ ŻE da = 0. 


But this equation is the same as the previous one, by (92), i.e. 
infinitesimal displacements at a common point on (S,) and (8) are 
perpendicular to the same direction, the direction-cosines of which 
are proportional to: 


oF or oF 
da’ Oy’ oz”? 
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so that it follows that (S,) and (8) touch along (lọ). Thus in general, 
elimination of a from equations (91) and (92) gives the equation of the 
envelope of surfaces of family (91), with contact taking place along a 
line. 


Example. Let us take the family of spheres with centres on the z axis and 
radius r (constant): 
at +y? +(e — a)? = rè, 
We differentiate with respect to a: 
— 27 — a) =0. 
On eliminating a, we get the equation of the circular cylinder: 
x? + y? = r2, 


which touches each of the spheres in a circle. 


We now consider a family of surfaces with two parameters: 


F(x, y,2,a,b) = 0. (93) 
On eliminating a and b from this equation and the equations 
ƏF(x, y, z,a, b) _ h. OF (x, y,z,a, b)  __ 
Pegg a Sa: pee (94) 


it is easily shown that we get a surface (9), which touches the surfaces 
of family (93). But in this case, contact only takes place at a point, 
instead of along a line. In fact, on taking a fixed a = a, and b = by, 
on the one hand we get a definite surface (Sy) of family (93), and 
on the other hand, substitution of a = a, and b = b, in the three 
equations (93) and (94) in general gives a point M, on the surface (8). 
The point M, will be common to (S) and (8,). 


Example. Let us take the family of spheres with centres on the XY plane 
and a fixed radius r: 


(œ — a} + (y — 0) +2 = re 
We differentiate with respect to a and b: 
— 2(x —a)=0; — 2y —b)=0; 


Elimination of a and b gives us the equation 2? = r?, i.e. the envelope consists 
of two parallel planes z = +r, which touch each of the spheres at a point. 


The remark regarding the determination of the envelope of a 
family of curves [10] applies equally when finding the envelope of 
a family of surfaces: elimination of a from equations (91) and (92), 
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for instance, can lead not only to the envelope but also to the locus 
of singular points of the surfaces of family (91), i.e. those points at 
which the surfaces have no tangent plane. If the left-hand side of 
(91) is continuous and has continuous first order derivatives, every 
surface that touches the various surfaces of family (91) at all its 
points can be found by the above method of eliminating a from equations 
(91) and (92). In general, we omit proofs and precise conditions in 
this article and the next, and confine ourselves to giving the broad 
outlines of basic facts. 

We now consider a family of curves in space, depending on a single 
parameter: 


F(x,y, 2,a) = 0; F, (x,y, 2,a) = 0. (95) 


We shall seek the envelope of the family, i.e. the curve I’, every 
point of which is a point of contact with a curve of family (95). 
We can take I’ as also defined by equations (95), except that now a 
is variable instead of constant [10]. The projections dz, dy, dz of a 
small displacement along curves (95) must satisfy the equations 


SE. dx + = dy + os dz = 0; 
oFs dz + = dy ane a dz = 


Similarly, the projections 62, a ôz of a small displacement along 
T must satisfy the equations: 


om cea ôz + ôa = 0 


ôx -+ S ôy + ai 


ar, ar, 


bx + 28s y Y t 


6z + i ôa = 0. 


The condition for contact amounts to these projections being pro- 
portional, i.e. 


and this in turn, in view of the above relationships, is equivalent 
to the two equations: (0F,/ea) da = 0 and (ƏF,/ða) da = 0, or, if 
we take ôa # 0, i.e. a not a constant, we get the two equations: 


OF (x,y,z, a OF, (x,y, 2, @ 


da 


The four equations (95) and (96) do not in general define a curve, 
i.e. as a rule, a family of curves in space has no envelope. But if the 
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four equations reduce to three, i.e. one is a consequence of the others, 
the coordinates (x, y, z) will be defined by the three equations as 
functions of the parameter a, so that we get a curve in space, which 
is in fact an envelope [or the locus of singular points of curves (95)]. 
We have an example in the next article of a family of straight lines 
in space which has an envelope. 


141. Developable surfaces. We take the particular case of a family 
of planes with one parameter a: 


A(a)x + Bla)y + C(a)z + D(a) = 0, (97) 


The envelope (8) is obtained by eliminating a from the two equa- 
tions: 
A(a) « + Bla) y + Cla)z + D(a) =0 | (98) 


A’ (a)a+ B' (ajy + C’'(a)z + D'la)=0. 


With a fixed, these equations yield a straight line (la), and the 
surface (8) is the locus of these straight lines, i.e. (S) must be a ruled 
surface. It may further be seen that not every ruled surface can be 
obtained by the above method. The surface (8) touches a plane (97) 
along (la), i.e. (8) has the same tangent plane along the straight genera- 
tor (la). Thus a family of tangent planes on (S) depends only on the 
single parameter a, designating the generator (la). The family of 
tangent planes to a surface depends in the general case on two 
parameters, defining the position of a point on the surface. Let the 
equation of (8) be written explicitly: z = f(x, y), the partial derivatives 
of the function f(x, y) being denoted as in [62]. The first two direction- 
cosines of the normal will be functions of the single parameter a: 


= W, (a); = W, (a). 


(Sy: Ras aan ern se 
JETEN itp +¢ 
Elimination of a from these equations gives a relationship between 
p and q that can be written as: 
q = Y(p). 
This relationship must be satisfied over the entire surface (S) and we 


find on differentiating with respect to the independent variables x and y: 


s=' (p)r; t= g (p)s, 
whence 
ri — s3 = 0, (99) 
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i.e. all the points on a surface enveloping a family of planes with one 
parameter must be parabolic. 

The surface (S) is generated by the family of straight lines (98). 
It is easily seen that this family has an envelope; differentiation of 
equations (98) with respect to a gives the two equations: 


A’ (aja + B’ (ajy +0 (ajz + D(a) =0, | 


(100) 
A” (a)x + B” (a)y + C” (a) z + D” (a) = 0, 


and the four equations (98) and (100) reduce to three. We can therefore 
say that (8) is generated by tangents to the curve J’ in space. If the 
curve J’ degenerates to a point, (S) is a conical surface, whilst it is 
cylindrical if the point is at infinity. We prove the converse: given a 
curve J’ in space, 


x= plt); y= vt); z= w(t), (101) 


the surface (S) generated by tangents to T envelopes a family of planes 
with one parameter, these being the osculating planes of T. The family 
has in fact the equation 


A(X — 1x) + B(Y —y) + O(Z—z) = 0, (102) 


where (x, y, z) are given by equations (101) and A, B,C are given 
by expressions (31) of [126]. On differentiating (102) with respect to 
the parameter ¢ and using the fact that, by (31), 


Adz + Bdy +Cdz=0, (103) 
we get 


dA (X — zx) + dB (Y —y) + dC(Z —2z)=0, (104) 


where we write the differentials instead of the derivatives with 
respect to t. The enveloping surface of family (102) is made up of 
straight lines determined by equations (102) and (104), and it remains 
for us to show that these two equations give the tangent to I’ at 
the point (x,y,z). We differentiate (103) and note that Ad’x +- 
+ Bd?y + Cd?z = 0 by (81); we get: 


d4 dx + dB dy + dC dz=0. (105) 
Equations (103) and (105) show that normals to the planes (102) 
and (104), which pass through the point (x, y, 2), are perpendicular 


to the tangent to the curve I’, i.e. planes (102) and (104) both pass 
through this tangent, which is what we wished to prove. 
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We saw above that condition (99) is necessary for (8) to be the 
envelope of a family of planes with one parameter. It may be shown 
to be also sufficient. We also mentioned above [138] that (99) (or its 
equivalent LN — M? = 0) is necessary for (3) to be able to be mapped 
on to a plane without distortion of length. The converse can be proved, 
that if this condition is fulfilled, a sufficiently small portion of the 
surface can be mapped on to a plane by the method described. For 
this reason, the envelope of a family of planes with one parameter 
is known as a developable surface. 

Not every ruled surface is developable. For instance, if we take 
a hyperbolic paraboloid or a hyperboloid of one sheet, (99) is not 
fulfilled for these [137], in spite of the fact that they are ruled surfaces. 
It follows from this that, if a point varies along a straight generator 
of such a surface, the corresponding tangent plane rotates about the 
generator. 

The French mathematician Lebesgue carried out a detailed investi- 
gation of surfaces developable into planes, with very few assumptions 
regarding the functions appearing in equations (38) of the surface 
(we have assumed the existence of continuous derivatives up to the 
second order). One of his results was a developable surface con- 
sisting of a non-ruled surface of revolution. 


CHAPTER VI 


FOURIER SERIES 


§ 14. Harmonic analysis 


142, Orthogonality of the trigonometric functions. The harmonic 
oscillation 


y = Asin (wt + p) 
represents the simplest example of a periodic function of period 
T = 2x/w. We confine ourselves for the present to periodic functions 
of period 2x and let x denote the independent variable, so that the 
function y becomes: 
y = Asin (x + p). 
More complicated functions of the same period are given by 


A, sin (ka +- p,) (k =0,1,2,3,...) 


or by the sum of any number of these: 
n 
> Asin (kx + Pı) 
k=0 


this sum being known as a trigonometric polynomial of the n-th order. 
The question now naturally arises of the approximate representation 
of any periodic function f(x) of period 2x as a trigonometric polynomial 
of the n-th order, followed by the question of the expansion of f(x) 
into a trigonometric series: 


fæ) = > A, Sin (ke + pi); 
k= 


these problems are similar to the problems of approximating a func- 
tion by a polynomial of the nth degree or of expanding it into a power 
series. The general term of the above series, 


A, sin (kx + Pr)» 


397 
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is known as the k-th harmonic of the function f(x). It can be written 
in the form 
A,,sin(ka + Pr) = a, cos kx + 6, sin ka, 
where 
a,=A,sing,; by = A,cosy, (k= 0,1,2,...). 


The zero order harmonic, A, sin pọ, is simply a constant, which we 
denote by a,/2 in order to simplify later expressions. Our problem 
thus amounts to choosing, tf possible, the unknown constants 


Qo A, by, Ag, bg ..., By, Onse 


in such a way that the series 
a Te OE e enn) (1) 


is convergent to a sum equal to the given periodic function f(x) of period 2x. 

As a preliminary to solving our problem, we note a simple property 
of the sines and cosines of multiple angles. Let c be any real number, 
and (c, c +- 2) an interval of length 2x. It is easy to show that 


c+2n C+2n 
f cos ka dx = 0; f sin kedz =0 (k= 1,2,3,...) (2) 
c ¢ 


If we take say the first of the integrals written, the primitive 
for cos kæ is (1/k) sin kæ, and in view of its periodicity, its values are 
the same for x =c and g= c +- 2x, so that their difference is 
zero, i.e. in fact, 

CH+2n À = 
f cos ker da = She ere ag 
Š x=¢ 
Similary, by using the familiar trigonometric formulae: 
sin(k +1) æ + sin(k — I) æ 
2 


sin kx cos lx = 


cos(k — 1) x — cos(k + 1) x 
p ’ 


sin ka sin la = 


66a ieee cos(k + 1) x + cos(k — l) x 
2 Ed 
it can be shown that: 
c+2n cHon 
f cos kx sin la dz = 0; f cos kæ cosla da = 0; 
c e 
c+2n (3) 
{ sinkesinlede=0 (k#l). 


c 
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Let us take the family of functions 
l,cosz, sing, cos2z, sin2z,..., cosnz, sinnz,..., (4) 


the first member of the family being a constant equal to unity. 
Formulae (2) and (3) express the following fact: the integral of the 
product of any two different functions of family (4) over any interval 
of length 2x is zero. This property is generally known as the 
orthogonality of family (4) in the interval mentioned. We now find the 
integral of the square of a function of the family. The integral is 
evidently equal to 27 for the first function, whilst for the remainder, 


since 
1 + cos 2ka 
2 


1 — cos 2ka 


cos? ka = z , 


; sin? kg = 
we have: 


C+2x c+2n 

f cos? ka da = n; f sin? kxdgz =n (k=l, 2,...). (5) 
c c 

To avoid confusion, we shall in future take c = —2, i.e. the interval 
(c, c + 22) now becomes (—x, 7). 

We now turn to the problem posed above. Let the function f(x) 
be defined in the interval (—x, x), in which case it is defined for 
other values of x by virtue of its periodicity of period 27; and let 
us assume that it gives the sum of series (1): 


fe) = + > (a, cos ke + b, sin ker). (6) 
k=1 
On integrating both sides of this equation over the interval (— x, 7) 


and replacing the integral of the infinite sum by the sum of the 
separate integrals, we get: 


+n +a œ +r +a 
| fe) dz = f -E dz + Z (a f coske de + bk J sin ke dz] ; 
which, by (2), reduces to the equation: 
+7 
| Hx) de = m2 2n = ayn, 


whence we determine the constant a,: 


+a 
a = -y | fa) de. (7) 
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We now have to find the remaining constants. Let n be a positive 
integer; let us multiply both sides of (6) by cos nz and integrate, 
as above: 


f f(x) cos næ dx = -2 Toos na da + 
+ > (a, foe kx cos nz da + bg fin kx cos nx dz). (8) 


All the integrals on the right of the equation are zero by (2) and (3), 
with the exception of the one integral: 


+n 
f cos kz cosngdz with k =n, 


this latter integral being equal to x by (5). 
Equation (8) thus reduces to the form: 


+a 
f f(x) cos ng dz =a, n, 
whence 
+z 
a, = — | Næ) cosne dz (n= 1,2,...). (7) 


In exactly the same way, we can obtain the formula: 


+H 
n= = f fz) sin ne de (n = 1,2,...). (72) 


b 


It may be noted that (7) is the same as (7,) with n = 0. We can thus 
write: 


x) coskadz (k=0,1,2,...) 


it 
ant 
Ya 
Jio f(z) sinkæde (k=1,2,...). 


b =l 
b=- 


The above working is not rigorous and is only valuable as a guide. 
We have in fact made a number of unjustified assumptions: firstly, 
we assumed right at the start that the given function had the expan- 
sion (6), then we replaced the integral of the infinite sum by the sum 
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of the integrals of the separate terms, or as we say, integrated term 
by term, which is not always permissible [cf. I, 146]. 

The rigorous statement of the problem is as follows. Let a function 
f(x) be given in the interval (— x, x). We evaluate constants a, and by 
in accordance with (9) and substitute the values obtained in series (1). 
The question arises: will the series thus obtained be convergent in 
the interval (—x, x), and if so, will its sum be equal to f(x)? 

The coefficients a; and b; obtained from (9) are known as the Fourier 
coefficients of the function f(x), whilst the series obtained from (1) after 
replacing the a, and bp with their values as given by (9) is called the 
Fourier series for f(x). We state in the next article the solution of 
the above problem of the convergence of the Fourier series for a 
given function. 

Remark. Expressions (3) and (5) given above are valid for integration 
over any interval of length 2z. In general, if a function f(x), defined 
for all real values of x, has a period a, i.e. f(x + a) = f(z) for all z, 
the integral of f(x) over any interval of length a has a definite value, 
independent of the initial point of the interval, i.e. the value of 


c4a 

(Woks 

c 
is independent of c. The number c can in fact be written in the form 
c = ma + h, where m is an integer and h belongs to the interval (0, a): 


(m+!ja+ (m+Na 


eta h (m+1)a +h 

{ f(a) de = fa) da = f fade + fla) de. 

c ma+h ma+h (m+ ja 
We introduce a new variable of integration t = x — ma into the 
first integral, and ¢, = x — (m + 1) a into the second: 


c+a 


f" hæ) de = f f(t, + ma) at, + f flt, — (m + 1)a] dt, 
c h 0 


On taking account of the periodicity of f(x) and denoting the variables 
of integration again by x, we get: 


c+a a h a 
f fla) dx = f fla) de + f fla) de = f fla) dz, 
c h 0 0 
whence it follows that the integral does not depend on c. If f(z) has 


period 27, we can evaluate its Fourier coefficients a, and bp in accord- 
ance with (9) by integrating over any interval of length 2z. 
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143, Dirichlet’s theorem. The Fourier series for a function f(x) will 
be convergent and its sum will be equal to f(x) provided certain 
restrictions are imposed on f(x). We suppose firstly that f(x), given 
in the interval (—-x, x), is either continuous or has only a finite 
number of points of discontinuity in the interval. We further assume 
that all these discontinuities have the following property: if x = c 
is a point of discontinuity of f(x), there exist finite limits for f(x) 
as x tends to c, both from the right (from larger values) and from 
the left (from smaller values). These limits are usually written as 
f(c + 0) and f(c — 0) [I, 32]. Such points of discontinuity are generally 
known as discontinuities of the first kind. We finally assume that 
the total interval (—x, x) can be divided into a finite number of 
parts such that f(x) varies monotonically in each. The above are 
generally referred to as Dirichlet conditions, i.e. we say that a function 
satisfies Dirichlet conditions in the interval (—x, x) if it is either con- 
tinuous in the interval or has a finite number of discontinuities of the 
first kind, and if, furthermore, the interval can be divided into a finite 
number of sub-intervals in each of which f(x) varies monotonically. 
At the end x = —xz, we are only interested in the limit to which 
f(x) tends as x tends to (— x) from the right, so that we shall write 
f(—a + 0) instead of f(— xz); and similarly, instead of f(a) we write 
f(x — 0). We remark that these limits can be different, but the sum 
of series (1) must of course be the same for x = —az and x= x, 
due to the periodicity of functions (4). 

The following theorem is fundamental as regards the theory of 
Fourier series: 

DrricuizT’s THEOREM. If f(x) is specified and satisfies Dirichlet 
conditions in the interval (—x, x), the Fourier series for the function 
is convergent throughout the interval and the sum of the series: 

(1) is equal to f(x) at all points of continuity of f(x) lying in the 
interval ; 

(2) is equal to 

ee nie 


at all points of discontinuity ; 
(3) ts equal to 


f(— 7 +0) + f(a —9) 
2 


at the ends of the interval, i.e. for x = —x and x = +n. 
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The proof of this theorem will be given at the end of the present 
chapter. 

Certain points may be noted in regard to the statement of the 
theorem. The terms of series (1) are periodic functions with period 22. 
Hence, if the series is convergent in the interval (—z, 2), it is likewise 
convergent for all real values of z, 
and the sum of the series periodi- 
cally repeats, with period 2x, the 
values that it gave in (—a, 7). 
We must therefore assume, if we 
wish to use the Fourier series 
outside the interval (—x, x), that 
the function f(x) is continued out- 
side with a periodicity of period 
2x. The ends of the interval z = +% are from this point of view points 
of discontinuity of the continued function, if f(—2 + 0) Æ f(z — 0). 

A function continuous in (—z, x) is illustrated in Fig. 114, which 
gives discontinuities on periodic continuation due to having different 
values at the ends of the interval. 

The following lemma is often useful when calculating Fourier 
coefficients: 

Lemma. If f(x) is an even function in the interval (—a, a), i.e. f(—x) = 
= f(x), we have 


Fic. 114 


$F) de= 2 f f(x) dz, 
whilst if f(x) is an odd function, i.e. f(—x) = —f(æ), we have 
Ji (x) da = 0. 


The proof of this lemma was given earlier [I, 99]. 


144. Examples. 1. We expand v as a Fourier series in the interval (— x, 7). 
The products x cos kx are odd functions of x, so that all the coefficients a, 
are zero by (9). On the other hand, the products x sin kz are even functions, 
and the coefficients by may be evaluated from the formula: 


2(—1) 


k 


d= È f esin kode Z f Zeot 
0 


m 
+ = feos kx az} = 
0 


x=0 
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The graph of the Fourier series is drawn with a full line in Fig. 115, and it 
is clear from the figure that we have discontinuities at x = + x, the arithmetic 
mean of the limits from the left and right being evidently zero. Dirichlet’s 
theorem thus gives in the present case: : 


9 (Se inte (— 1)*-1 sin ka ) B k for —rz<a<272 


IT A Post k O for == + m. ` (10) 


Fie. 115 


2. We do the same for x?. Here, the x? sin kx are odd functions and all the 
by are zero. We calculate the aç: 


27 222) Ont 
x3 7 
= — 2 x — ——— = 
Qy z E da x3 373 
0 x=0 
n x= 
ig 
a= =f at costado = = [eR he sin k) Es 2 f esinke aa} = 
n n k 
0 x=0 0 
ke) af 4 
= 1 Jecoske| If =(— pk 
aE | y | cos ke àa) (—1) FE 
0 


It is clear from Fig. 116 that the graph of the Fourier series has no discon- 
tinuities, and the sum of the series is equal to x? throughout the interval, includ- 
ing its ends: 


cos ka 


ae a E (=a <<a). Q1) 
On setting x = 0, we get: 
1 1 1 k—ı l _ 
Le ta ptet aes Sit (12) 
If we put 
| 1 
Bae hag Pg eo 
re | 1 
Sec ie, Ek = 13 
ibe ttt o1; (13) 
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we evidently have 


1 1 1 1 3 
Hat Ttigtst: =O +7” = 79, 
and equation (12) gives 
1 1 1 1 1 n? 
ee nS ee ar ge age 
i.e. 
1 1 l m 
Se A a ge 
1 1 1 n? 
Gee Dar rae toot aapi +... =: 


~5X -4X -3X -2X -X 0 X 2K 3x 4 SH 


Fie. 116 


3. We expand as a Fourier series the function: 


TER ie 
¢,for O<a<ca. 
We have here 
+2 0 x 
a= > f(e)de= =| fede t fede] =o tes 
—n -7 0 


+ 0 
a, = | Fi) cos keda = =| fes cos ke de + 


—n ax 
+n 
+ i c, cos ka da = 0, 
0 
+n 0 a 


w=} [ræ sin te ae =| feisin ke de + f casin ke de] 
0 


Š = 
(—1*—1 
ak , 


= (c; — ca) 


405 


(14) 
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i.e. by = 0 for even k and by = — 2(c, —c,)/ak for odd k, so that by Dirich- 
let’s theorem (Fig. 117): 
Cy for —-a<a2<0 
ote 2 (cy — e sing , sin 3x c for O<e<cn 
utes ni 2) $ i + 5 +...]= 2 (15) 


Sit 8 fon a2=Qandin. 


Fie. 117 


145. Expansion in the interval (0, 2). We have simplified the 
evaluation of the Fourier coefficients in the above examples by 
making use of the evenness or oddness of the expanded f(x). 

In general, on applying the lemma of [143] to integrals (9) defining 
the Fourier coefficients, we get: 


a= =f f(2) coskxda; b, = 0, (16) 
0 
if f(z) is an even function, and 
2 n : 
a, = 0; b= |F (æ) sin ka dy, (17) 
ò 


if f(z) is odd. The actual expansion of the function will be of the 
form 


B+ Sa, cos ke, (18) 
k=1 
if f(x) is even, and 
>}, sin ka, (19) 
k=l 


if f(x) is odd. 

Now let any function f(x) be given in the interval (0, x). It can 
be expanded in (0, x) either in a series of the form (18) containing 
only cosines, or in a series of form (19) containing only sines. The 
coefficients are evaluated in accordance with formulae (16) in the 
first case, and in accordance with (17) in the second case. Both series 
have a sum inside the interval equal to f(x), or to the arithmetic 
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mean at points of discontinuity. Outside (0, x), however, they repre- 
sent quite different functions: the cosine series gives a function obtained 
from f(x) by even continuation in the neighbouring interval (—z, 0), 
followed by periodic continuation with period 27 outside the interval 
(—a, x). The sine series gives the function obtained by odd continuation 


Fie. 118 


in the neighbouring (—2, 0), followed by periodic continuation with 
period 2x outside (—z, 7). 


x series in cos series in sin 
0 K+0) 0 
n ja — 0) 0 


Thus in the cosine expansion: 


f(— 0) =f(+ 0); 
f(— x -+ 0) = f (x — 0), 


Fra. 119 


whilst in the sine expansion: 
f(— 0) = —f(+ 9); 
f(— x +0) = — f (x — 0). 


Correspondingly, we obtain at the ends of the interval the values 
shown in the table for series (18) and (19). 
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Figures 118 and 119 illustrate the graphs of the functions represented 
by series (18) and (19), derived from the same function f(x) in the 
interval (0, 7). 


Ezamples.1. We obtained in examples 1 and 2 of [144] a sine series for 
the function v and a cosine series for x? in (0, x). On expanding v as a cosine 
series in (0, x), we get: 


w% n 
r=- a+ D % 008 ke; a= [edema 
k=l 6 
2 7 2 0 for even k 
=> = — k = 
a, = = | 2008 ke de = 3 [(— 1)* — 1] eer 
0 nk? 
y 


“40 -IT -27 -X 0 x 2x 5K 4X 
Fie. 120 
Hence: 
_% 4 (cose cos 3x cos (2k + 1) a 
2-5 -E (5 bet IF +) (20) 
(0 <x <n). 


The sum of the series on the right will amount to (—2) in the interval (— x, 0), 
i.e. the sum gives the absolute value |x | throughout the interval (— x, x): 


(21) 


n 4 (coss cos 3x cos 5a 
(SS t-a tga +) 


whilst outside this interval it gives the function obtained by periodic repetition 
of | x | from (— x, x) (Fig. 120). The sine expansion of x? in (0, 2) gives: 


> k-1 k 
bp = [at sin ke de m I 4[(— 1) —1j 
0 


k nk? 
and 
= sin % =. sin 2a sin 3x = ~= (44 sin 3x 
at = 2a SO =a 2 ines e in + 35 + 
in ba 
+a te. 


in the interval 0 < v < x (Fig. 121). 
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We suggest that the reader prove that we can split the Fourier series obtained 
into two series, as has been done above. 


2. The function cos zz is even in regard to x, so that it can be expanded 
in cosines in the interval (— 2, 2): 


co % 
a, 2 
cos zz = —2 + > 4, cos kz; ak = — f cose cos ke da. 
2 n 

k=1 ó 


—-—~~_---W 


Fic. 121 


We have: 


x=» 2Qsin nz 


x=0 m ° 


n 
2 2 sin za 
a, = — | cos za da = — 
n m z 
0 


7 n 
ak = = feos 2x coska de= = f [eos (2z + k) x + cos (z — k) x] dr = 
0 0 


_ Lf sin(gg +h)a sin (z — k) x ]x=2 
== z+k + z—k ] 


el [mtm + et j- 


zu zt k z—k n (z3 k?) 
Accordingly, in the interval — z < v < a: 
gosar eaen | l | cose _ cos2¢e , cos3e 
alge te a t 


On setting x = 2 here and dividing both sides of the equation by sin nz, 
we have: 


cot mz = [2 : a (22) 


z k?— z J 
k=1 
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This formula is known as the decomposition of cot nz into partial fractions. On 
differentiating with respect to z, dividing by 2 and reversing the sign, we get 
the decomposition of l/sin? xz into partial fractions: 


1 1 <S k%+22 
sin? zz = lat = (k2? — aaa | 
or, noting that 


k? +22 1 1 
(2) +h EOF 


we can write the above in the more symmetrical form : 


1 


a a 1 
sin? az a, =k’ (23) 


Expression (22) leads to a striking expansion of cot z into a power series. 
On multiplying both sides by zz and replacing xz by z, i.e. z by z/, we get: 


S 222 
zcotz=1— PA Penz 


But 


22% E 222 
CEET vel- 


aa) 


z2 z2 za gen 
=2 (lt gr t par tet pega t+) (e< m. 


On substituting this in the above formula and expanding in powers of z? 
we have 
Pr | oe | 
zcotz = 1 — 2-7 DE E — La A S a D — 


Substitution of z/2 for z gives us: 


I Eose a E a 1 Joan 
yz cot 52 =1— Dle Žž, or | i 


n=1 
We shall denote the coefficient of z2?” by Bp/(2n) t: 


1 Ty se B, r B, a Ba = 
z7 eot- =l L-a — -2 Sai nji 


22n S 
Ba= aa À, E: a 
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The first of the B, can be found without difficulty by direct expansion of 
(z/2) cot 2/2, on writing this as the quotient of the series for cos z/2 divided 


i 2 
by the series for = a [I, 130]: 
1 l I l 5 
B= Bsgo Page Bera B= 


6 


and it is clear at once that the B, are rational numbers. They are known as 
Bernoulli numbers. On the other hand, knowing their values, we can find the 
sums of the series: 


œ =o (22)"" Br E 
2 n a OSD) 


Occasionally, instead of Bernoulli numbers, we take the Eulerian numbers, 
defined by the expressions 


1 — 1}? B 
4 =l; 4=--7 S Ay =k 
Anm =0 (k=1, 2, 3,...)s (25) 


If we replace z by ¢/¢ in equation (24), the fact that 


t 1 1 

a >t ->t 
t ott OS t eè +e? __ t PE 
2i 2% 2i t ae aie l,  e&=1 2” 

an gy e? —e ? 

leads us to 
baa t B, B, iA ior Bri” 

Ca ge ep ge ED Seay 


=A,+A,t+A,2+ A+... 


Bernoulli and Eulerian numbers are often encountered in various branches 
of analysis. 


146. Periodic functions of period 2l. It often becomes necessary to 
expand into a trigonometric series of cosines and sines a function defined 
in an interval (—1, l) instead of (—a, x); or alternatively, to expand 
in cosines or sines only a function defined in (0, J). 

This problem reduces to the above with the aid of a change of 
scale, i.e. by introducing in place of x an auxiliary variable & in 
accordance with 


pa EA (26) 
We put 
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If f(x) is defined in (—l, l), y(&) is defined in the interval (—z, 7) 
of variation of &. 
We obtain on expanding ¢(é) into a Fourier series: 


40 4. $ (a, cos kë + bysin kê), 
k=1 


where, by (26): 


a, = + foe cos ké dé = af f(-E) cos ke ag = 


+ 
= {(x) cos 


; dz; (27) 
—l 


krx 
l 
+l 
kax 


b= f(x) sin —— da. 
=I 


It follows that Dirichlet’s theorem remains valid for an interval 
(—l, 1), except that expansion (6) is replaced by the expansion 


krx 


to < . knas 
d + 5 (2008 +b, sin 7) (28) 


the coefficients a, and bp being defined in accordance with expressions (27). 
The same applies to the cosine or sine expansions of a function 
f(x) given in the interval (0,1), the series obtained being 


I 


3 + Sa,cos = ; a, = +- | f(x) cos Erz dz (29) 
k=l Fy 
and 
I 
X bsi  ; be =$ | Aa)sin = dz. (30) 
kal å 


Example. We find the sine expansion of the f(x) defined by: 


sin- for 0 <s<- l 
f(z) = 


0 forl <a<il. 


We have in this case: 


I 
by = -$ fræ sin 
ò 
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since the integrand vanishes in (?/2, 1). Simple working, which we leave to 
the reader, gives: 


0 for odd k>1 
l k 
by = T 
(—1)? 2k 
me? T) for even k 
1 
ar ae 
so that 
. AL 1 
sin -y> for 0< e<- 
1 
1. xæ 4 a (~1In . anne 0 for zr! <a<l 
ai ara oa aa i (81) 
z for oat 
0 for z=Oor l. 


The interval (—1, l) can be replaced by any interval (c, ¢ + 24) of length 
2l, as already mentioned as regards intervals of length 2x. With this, the sum 
of series (28) gives f(z) in the interval (c, c -+ 21), and in evaluating the coef- 
ficients from formulae (27) the interval of integration (—I, l) has to be replaced 
by the interval (c, c + 21). 


147, Average error. The theory of Fourier series may be approached 
in a different manner. As above, let f(z) be given in the interval 
(— x, 2). We form a linear combination of the first (2n + 1) functions 
of family (4): 


a, m3 ; 
A + = (a, cos kx + B, sin ka), (32) 
where ao, di, Bis .--, An, Ên are numerical coefficients. The expression 


written is generally known as a trigonometric polynomial of the n-th 
order. We consider the error resulting from taking the sum (32) for 
f(x), i.e. we consider the difference: 


a 


a = (a, cos ke + B, sin kz) 


4, (#) = f(a) —{ 


The greatest deviation A, of sum (32) from f(x) in the interval 
(—2, x) is defined as the greatest value of | 4,(z) | in the interval. 
The smaller 4,, the more accurately the nth order trigonometric 
polynomial (32) represents f(x). The quantity 4, is not suitable as 
a measure of the approximation, however, not only because the 
investigation of its value is difficult, but also because it is often 
more important in problems concerning the approximate representa- 
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tion of functions to achieve a reduction in the “average” or “probable” 
error rather than a reduction of the ‘‘greatest deviation”. Figure 122 
illustrates two approximate curves (dotted) for a given f(x) (full 
curve). The greatest deviation of curve (1) is less than that of curve (2), 
yet in general (1) differs far more from the true curve than (2) does; 
the deviations of (2), though considerable in the interval (—z, 2), 
are of much smaller duration than those of (1). 

When applying the method of least squares to checking the accuracy 
of a series of observations, we make use of the average or “root mean 
square” error, this being defined as follows: 
let the values obtained in measuring a quan- 
tity z be 

Zis 2a). © es ZN5 
the error of each measurement is 


2 — žk (k=1,2,...,N); 


the average error 6, is, by definition, given by 


Fie. 122 


62 1 y 2 
B= Sle — 4), 


k= 


~ 


i.e. ôn ts the square root of the arithmetic mean of the squares of the 
errors. 

It is this average error that we take as a measure of the degree 
of approximation of sum (32) to our function f(z). We only need to 
bear in mind that it is a matter of an infinite set, and not a finite 
number, of values, continuously distributed throughout the interval 
(—x, z). Each separate error is thus in fact A(x), and the arithmetic 
mean of their squares will be 


+n 


1 
sax | 43 (a) de, 


-7 


whilst the average error ôn of expression (32) is obtainable from the 
equation: 


+n 
= 5 f 4 (e)dr = 


=o i [æ — sin kæ}? dz. (33) 
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It now only remains for us to select the constants ao, a,, fr ..., 
an, Bn in such a way as to get the minimum 6%, i.e. we have the ordinary 
problem of finding the minimum of the function 62 of (2n + 1) 
variables. 

We first of all simplify expression (33) for 67. We find on squaring: 


P= 


= [fla]? — ay fla) — 2 X (a, cos ke + By sin kæ) f(a) +È + 
kal 


fia) 


+ X (a? cos? ka + b? sin? kx) + Op, (34) 
k=l 


where o, denotes a linear combination of expressions of the form: 
coslacosmaz, sinlxsinmz (14m), 
cosxsinmaz, coslz, sin mz. 


By the orthogonality of the trigonometric functions [143], the integrals 
of all these expressions vanish over the interval (—n, x), whence it 
follows that the integral of o, over the interval vanishes. The inte- 
grals of cos? kx and sin? kæ are equal to x, as we know, and we get 
on substituting (34) in (33): 


+n +n 
d= | Maeda — $e f f(a) de— 


-+ Saf scone +B Fe ) sin ke dar] + 


k=1 


+345 PACAL] 


We use expressions (9) for the coefficients of the Fourier series for 
f(z) to rewrite the expression for ôf as follows: 


+f ver de — 20% S (apay + bebi) + 


k=l 
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or, on subtracting and adding the sum 
a l Sm Lp 
4 + 9 > (ai + bk), 
k=1 


we can write our expression as: 


n 


on = -JV (x)? de — É — PACER) +4 (dy — a)? + 
-+ = 5 [(a, — a)? + (Bk — b,)2]. (35) 


The value of 6% is clearly a minimum when the last non-negative 
terms on the right-hand side vanish, i.e. when a, = a, and generally, 
ak = a, and fk = by (k = 1, 2, ...). It follows that the average error 
of the approximation to a function f(x) by means of an n-th order trig- 
onometric polynomial is a minimum when the coefficients of the poly- 
nomial are the Fourier coefficients of f(x) 

An important point must be noted. It follows from the result 
obtained that the values of ay and p leading to minimum 6° do not 
depend on the subscript n. If we increase n, we must add new coeffi- 
cients a, and fp; but those already calculated remain the same. 

The least error en is found from (35) by replacing a, and fk by 
a, and bp respectively: 


+n 2 n 
aay | lope ges 7 2 (ak + Bb, (36) 
or 
+x j n 
2eh=— | Pepa- $- > (ak + bp. (37) 


On increasing the order n of the trigonometric polynomial, new 
negative (or at any rate, not positive) terms, —an4,, —baay,, .--, are 
added to the right-hand side of (37), and therefore the error en can 
only diminish on increasing n, i.e. the accuracy of the approximation 
increases (does not decrease) on increasing n. 

The quantity ef is given by (33) if the a,, fx in it are replaced 
by apy, by respectively, ie. is given by the integral of the square of 
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a certain function, so that eż is indeed positive or, more precisely 
not negative. On taking this into account, we get by (37): 


2 n +x 
zt Sat) <= Sae. (38) 


We have so far made no explicit assumptions regarding f(x). It is 
necessary for the above arguments that all the integrals employed 
should exist, i.e. that the Fourier coefficients should be calculable 
in accordance with formulae (9) and that the integral of the square 
of the function exists. We shall assume for definiteness that f(x) is 
continuous or has a finite number of discontinuities of the first kind. 
All the integrals concerned certainly have a meaning with this 
assumption [I, 116]. We could in fact make far more general assump- 
tions regarding f(x) and in any case, those that figured in the Dirichlet 
conditions have no actual part in the above and future arguments. 
To return to inequality (38): as n increases, the sum of the positive 
terms on the left increases (does not diminish), whilst remaining less 
than the definite positive number occurring on the right. It follows 
immediately that the infinite series 


= (aj + b2) 


is convergent [I, 120]. On letting n tend to infinity and passing to 
the limit in (38), we get: 


2 = ae 
H+ Sak +o <= | fm] ae. (39) 


On recalling that the general term of a convergent series must tend 
to zero as we move away from the initial term, we can state the 
following theorem: 

THEOREM. With the assumptions made regarding f(x), its Fourier 
coefficients a, and by tend to zero as k —> +. 

The following is a fundamental problem from our new point of 
view: will the error £e, tend to zero with indefinite increase of n? If we 
pass to the limit with n increasing indefinitely on the right-hand 


n 
side of (37), we get instead of the finite sum = the infinite sum 
œ k=1 
> ie. 
k= 


lim 26% = + fvope-4 =P (az + b), 


418 FOURIER SERIES [148 


whence it follows that the fact that «, tends to zero is equivalent to 
our taking the sign of equality in (39), i.e. 


+a $ 
=| V@par= É+ Fat t. (40) 


This is generally known as the closure equation. We show in the next 
section that ¢,— 0, i.e. equation (40) is in fact valid for all functions 
/(x) with the above-mentioned properties. 


148. General orthogonal systems of functions. Most of the discussion 
of this chapter is based on the orthogonality of the functions of 
system (4) and not on the properties per se of the trigonometric func- 
tions. The discussion is on this account applicable to any system of 
orthogonal functions. Such systems are of frequent occurrence in 
mathematical physics, as we shall see. Let a system of real functions 
be given in the interval a < æ < b, the functions being assumed 
continuous for the sake of clarity: 


Pı (x), P2 (x), PAI Pn (x), eee (41) 


We shall suppose that none of these functions is identically zero. 
The functions of system (41) are said to be orthogonal if 


b 
Í Om (&) Py (x) dx = 0 with mn. (42) 


The integral of the square of each function of system (41) is equal 
to a certain positive constant. We introduce the following notation 
for these constants: 


b 
k, = J [pn (x) ]? da. (43) 


If we multiply each of the (x) by the respective numerical factor 
1//k,, we obtain new functions 

(2) =F (2); Ya (2) = = Pa (2); 
Yı VE, *2 > Y2 Vi, TFET 


Ya (2) = y Pa (2); a 
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which, by (42) and (48), satisfy not only the condition for orthogonality, 
but also the condition for the integral of the square of a function 
to be unity, i.e. 


b 0 for mn 
{Ym (22) Yn (2) de = | 4 (44) 
i 1 for m=n. 
The functions 
Y1(%), Pa (L), -> Pa(Z),-.. (45) 


of a system are said to be orthogonal and normalized if they satisfy 
condition (44). Let f(x) be a function defined in the interval (a, b} 
and let us suppose that it can be expanded in the interval in a series 
of functions (45): 


= See, (x), (46) 
k=l 


where the c, are numerical coefficients. We multiply both sides of 
(46) by p,(z) (n = 1, 2,...) and integrate over (a, b), making the 
assumption that the series on the right can be integrated term by 


term: 
b 


o b 
J f (2) Pn (x) da = > oe YK (x) Yn (x) dz. 
k=1 aqa 


a 


On taking (44) into account, we get the following expressions for 
the coefficients c,: 


Peat, (x) p, (£ (n= 1,2,...). (47) 


a 


The cn defined by these expressions are usually called the generalized 
Fourier coefficients of the function f(x) with respect to the system of 
functions (45). The above remarks are only of an introductory nature, 
as in [142], and a rigorous treatment implies the following problem: 
if the coefficients c,, calculated in accordance with formulae (47), 
are substituted in the series on the right-hand side of (46), will the 
series be convergent in the interval (a, b), and if so, will its sum be 
equal to f(x)? The solution of this problem naturally implies making 
certain assumptions regarding the properties of f(x). The series obtained 
by substituting for the c, their values from (47) is usually known at 
the generalized Fourier series for f(a). 
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We can take an alternative approach by writing an expression 
for the average error in representing the given f(x) as a finite sum 
of the form: 


n 
D YkYk (x). 
k=1 


The square of the error will be given by: 
1 2 < 2 
= gir [FO - Eroa) de 
a 


If we take (44) and (47) into account and carry out working similar 
to that of [147], we get: 


(b —a) 8 = Papi- a+ $ a 


Hence it follows at once that 6; is a minimum when the y, are 
equal to the Fourier coefficients of f(z), and denoting the minimum 
by £n, we have 


(b—a)e =f mar- Seb 


Hence the convergence follows, as above, of the series 


oo 
Sà 
k=l 


and we have the inequality 
b 
ža< f [/ (x)]? dx, (48) 


which is generally known as Bessel’s inequality. A basic problem here 
is whether ¢, tends to zero on indefinite increase of n, this being 
equivalent to having the equals sign in (48), i.e 


fi [f (x) ]? da = 5 2. (49) 
a k=1 


This is known as the closure equation for f(x) with respect to the 
system of functions (45). The system is said to be closed if equation 
(49) is valid for any continuous function f(x) and for any function 
with a finite number of discontinuities of the first kind. It may be 
mentioned that if this is the case, it can be shown that (49) is valid, 
in fact, for a far wider class of functions. 
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The proof of the closure equation was given for various systems 
of orthogonal functions by V. A. Steklov, who pointed out the 
importance of the equation in the theory of orthogonal functions. 
A. M. Lyapunov first proved the equation for the case of trigono- 
metric series. 

We return to the system: 


l, cosx, sing, cos 2x, sin 2x, ..., COSng, sinnz,.. 


These functions possess the property of orthogonality in the interval 
(—x, x) but they are not normalized, i.e. the integrals of their squares 
are not equal to unity. It follows from the discussion above [142} 
that the normalized orthogonal system is here 


a, J cosa, +-sing, ... 
Yin’ yx ’ Yn sa 

-L cosna, — sinna 

yx "Yn S 

There is a simple geometrical analogy to the above. We take ordinary 
three-dimensional space and let A be a vector with components 
Ax, Ay, A, along Cartesian axes. The square of the length of the vector 
is given by [103]: 

| A |? = A? + 43 + 42. (50) 


If we take two vectors A and B, the condition for them to be per- 
pendicular is [103]: 


A,B, + Ay By+ 4,B,=0. (51) 


We now consider a far more complicated vector space: we take 
as a vector every real function f(x), given in the interval (a, b), and 
possessing certain general properties similar to those discussed in 
previous articles, which enable the necessary integrations to be 
carried out. 

By analogy with (50), we take the magnitude of the integral 

b 


f Ff dx 


a 


as the square of the length of the vector f(x) of our function space, 
and by analogy with (51), we say that two vectors /,(x) and f(x) 
of the space are perpendicular or orthogonal if 


(fr (2) fe (2) dr = 0. 
b 
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We have here replaced the finite sums of (50) and (51) by integrals 
over (a, b). Using this terminology, we can say that condition (44) 
is equivalent to the fact that the vectors y(x) appearing in family 
(45) consist of parallel pairs and are of unit length, i.e. vectors y,(x) 
in our function space areanalogous to a system of mutually orthog- 
onal unit vectors of ordinary space [102]. Let f(x) be any vector 
of the function space. We can say that the c, evaluated from (47) 
are the components of the vector f(x) along the fundamental set y,(x). 
Bessel’s inequality (48) is equivalent to the fact that the sum of 
the squares of the components does not exceed the square of the 
length of the vector itself. If we take a fundamental set of three 
mutually orthogonal unit vectors in three-dimensional space, we 
always have the sign of equality, by (50). But if we forget about 
the third unit vector, for instance (directed along the z axis), we 
now have to write instead of the equals sign: 


AB + AB < [AF 


the equality being only applicable to vectors lying in the XY plane, 
whilst a strict inequality applies for the rest. An infinite set of 
mutually orthogonal fundamental vectors exists in functional space 
and no simple verification is possible of the fact that none have been 
passed over. 

If condition (49) for closure applies for all f(x), i.e. for all vectors 
of the functional space, the condition, which is analogous to (50), 
also acts as a test for the fact that no fundamental vector is left out, 
i.e. for the fact that no new fundamental vector y(x) can be added 
to system (45), orthogonal to all those already present. Let such a 
y(x) in fact exist, i.e. 


f yo (2) yp (2) de =0 (n=1,2,...). 


It follows from this, by (47), that all the Fourier coefficients of y(x) 
with respect to functions (45) are zero. By hypothesis, (49) must be 
valid for all f(x) and, in particular, for y,(x). But all the cn are zero 
for the latter, and (49) gives: 


f [vo (x)]? da = 0. (52) 
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If it is assumed say that y(x) is continuous, it follows from (52) 
that y(x) is identically zero in the interval a < x < b. The system 
(45) of orthogonal functions is said to be complete with respect to 
continuous functions if no continuous function exists, apart from 
those identically zero, orthogonal to all functions (45). It follows 
from what has been said that closure implies completeness with 
respect to continuous functions. The ability to obtain closure 
from completeness is bound up with a more general definition of the 
latter (not only with respect to continuous functions). 


149. Practical harmonic analysis. The operation of expanding a given function 
in a Fourier series is called harmonic analysis. If the function f(x) is given 
analytically, the problem is solved by means of formulae (27), which define 
the Fourier coefficients. In most practical cases, however, the function is given 
empirically, and the task of harmonic analysis is then to work out the most 
suitable methods either for evaluating the Fourier coefficients or for obtaining 
directly the harmonics of various orders in the given function. 

Computational methods of harmonic analysis are based on the application 
of approximation formulae for integrals to the integrals for a, and by. The 
rectangle formula is the simplest of these [I, 108]. 

We shall take an interval of length 22, as is always possible by a suitable 
choice of scale on the 2 axis. We take x = 0 and x = 2x as the ends of the 
interval. We now divide (0, 2) into n equal parts and denote the abscissae 
of the points of subdivision by 


Ly = 0, a, Lg, -> -s py, Ty = 2M, 
the values of f(x) at these points being denoted by 
Far Vay: Yigg sien neb Xn 


We now have by the rectangle formula: 


ghal e RS 
ak ~a Y, cos kaj; on È Y; sin ka, (53) 


and the various methods for evaluating the coefficients a, and b, have the aim 
of simplifying formulae (53) and reducing to the minimum the number of 
multiplications necessary. 

The method below, which is borrowed from W. Lohmann,} is based on 
certain transformations of formulae (53) and is extremely convenient both on 


+ Harmonische Analyse zum Selgstunterricht, Berlin, 1921. 
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account of the simplicity of the manipulations it requires and due to the high 
degree of accuracy of the results. 

1. We take the graph of the curve to be analysed and draw the abscissa 
as close underneath it as possible (Fig. 123) in order to avoid both negative and 
very large ordinates. We divide the period of the curve into twenty equal parts. 

2. We make up a table on a sheet ruled into squares as shown in Fig. 124. 
The numbers 1, 2, ..., 20 in the first column denote the abscissae, whilst those 
in the second column denote the corresponding ordinates of the curve, obtained 
directly from the graph. It is useful to take 
the scale of the graph small enough for the 


ordinates to be integers. l 2 1796! 7863501716159 


f 3 “ao tesco 3 


i CAIO | 


a Period 5 im 8176 | = 
F -=--> 
6 1205 

260 Ta E S ERL 
a 17 EEA aia F 
22 A E 0358 159 {301 

| l ' 
zor ft i | 19 182178 108 148 25 L 

É | 

of 1 1 | cay 187169 50 | 126! 
760 t | i 17 182 = —-b 1 

I Foi lo 1 

Mi 112184161 182 198179 | 
obi tt | 1 12 1841 67 152 138119 | 
120 ! ! LBI 20128123177) EAN = 
100 ry EAIC AEREN 

arati 1151 15 | v4 ii 9 15 
80 l i LEH PA =4- ad 
Soc Bas 0 aa 
oL ITELE] | 171110 11051 89165133 1 
an ace | 18 E680 | 99147] 

FESgceeee 119 774785 1411109152, _ 
128456769 101 121314151617 18192021 | 2011731184140 102152 | 
Fie. 123 Fie. 124 


We write in the next column the products of the ordinates with cos 18° = 
= 0.95, in the next their products with cos 36° = 0.81, in the next their products 
with cos 54° = 0.59, and finally their products with cos 72° = 0.30. We leave 
the last column blank, after shading its upper extremity in black (it is best 
to use arithmetic for the multiplications). 

3. To find the constant term of the expansion a,/2 = r, we add all the 
ordinates and divide the sum by twenty. 

4. To determine the coefficients ap, by (k = 1, 2, ..., 10), a template of 
transparent waxed paper is prepared for each separate coefficient to patterns 
as shown in Figs. 125. The size of the squares on the template and of the template 
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Fig. 125-c 


itself must accurately correspond with those of the table of Fig. 124. The 
squares need to be bordered with thin and thick lines, or lines of different 
colours. Each template is laid on the table (Fig. 124) and the sum 2;,) calculated 
of the numbers occupying squares with thick borders and similarly the sum 
Ži- of the numbers occupying squares with thin borders. 

After carrying out this operation for all the templates, we find the differences 
between corresponding sums (Zg) — 2(_)) and divide each difference by 10; 
the quotients obtained in fact give the coefficients a,, bi, a, by, - «+, Qro Dio 

5. We find the amplitudes r,,7, ..., 7) Of the various harmonics of the 
required expansion from the formula: 


ty = Va} + bf. 


We determine the phases p., Pz, .--,%, Pio Of the harmonics from the 
formula: 


tan pg = ve 


The angles px can be found to an accuracy of 1° by making use of the table 
given below. 
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The two numbers are sought in the table between which the ratio | a,/b, | 
lies, then the corresponding angle yp is read off, in tens of degrees on the left 
of the relevant horizontal row and in degrees at the top of the relevant column. 


0° 1° 2° 3° 4° 5° 6° 7° 8° 9° 

0°; 0 f 0.01 ' 0.03 f 0.04 | 0.06 | 0.08 | 0.101 0.11 i 0.13 | o5 0.17 
10° | 0.17 | 0-19 | 0.20 | 0.22 | 0.24 | 0.26 | 0.28 | 0.30 | 0.32 | 0.33 | 0.35 
20° | 0.35 | 0.37 | 0.39 | 0.41 | 0.43 | 0.46 | 0.48 | 0.50 | 0.52 | 0.54 | 0.57 
30° | 0.57 | 0.59 | 0.61 | 0.64 | 0.66 | 0.69 | 0.71 | 0.74 | 0.77 |! 0.80 | 0.82 
40° | 0.82 | 0.85 | 0.88 | 0.92 | 0.95 | 0.98 | 1.02 | 1.05 | 1.09 | 1.13 | 1.17 
50° | 1.17 | 1.21 | 1.26 | 1.30 | 1.35 | 1.40 | 1.46 | 1.51 | 1.57 | 1.63 | 1.70 
60° | 1.70 | 1.75 | 1.84 | 1.90 | 2.06 | 2.10 | 2.20 | 2.30 | 2.40 | 2.5 2.7 
70° | 2.7 2.8 3.0 3.2 3.4 3.6 3.9 4.2 4.5 4.9 5.4 
80° | 5.4 6.0 6.7 7.6 9 10 13 16 23 38 115 
90° | 115 — — — — — — = = ae ee 


Having found yg, we get 9, from the following table, according to the signs 
of ak and by. 


ak bk k 

+ T Pk = Yk 

+ PE Pk = 180° — Yy 
z al Pp = 180° + yy 
Ps Fa Pr = 360° — Pk 


All these computations can usefully be set out in the form of a table, as 
shown below for the case of the curve of Fig. 123. 
It may be noted in conclusion that the example worked out gives a reasonably 
accurate result only for the first harmonics, 
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I| f 3j uif e&f ofja Ila hda 
SININ a A] A] 2 3 D a 8 |B) 3 
1 11201 | 424 | -477.7 | 1121 496 | +62.5 | 6037.3) 3906.3; 100 1.24 i 51) 51 
2 | 832 | 819 |+ 1.3 804 785 |+ 1.9 1.7 3.6 2 0.68 | 34] 34 
3 | 653 | 968 | — 31.5 754 865 | —11.1 | 992.3 | 123.2 | 33.4 | 2.84 | 71| 251 
4 | 821 | 888 | — 1.7 785 798 |— 1.3 2.9 1.7 2 1.38 | 54| 234 
5 | 641 | 634 | + 0.5 654 640 |+ 1.4 0.3 2.0 2 0.36 | 20| 20 
6 | 832 | 827 |+ 0.5 797 786 |+ 1.1 0.3 1.2 1 0.45 | 24] 24 
7 | 808 | 813 |— 0.5 802 817 |— 1.5 0.3 2.3 2 0.33 | 18) 198 
8 į 823 | 828 | — 0.5 | 792 | 797 |— 0.5 0.3 0.3| 1 | 1.00 |45| 225 
9 | 816 | 809 | + 0.7 815 802 | + 1.3 0.5 1.7 2 0.54 | 29| 29 
10 |1277 |1294 | — 1.7 | — — — — — 2 œ |90| 270 


§ 15. Supplementary remarks 


on the theory of Fourier series 


150. Expansion in Fourier series. The present section is concerned 
with a more penetrating and rigorous account of the theory of Fourier 
series and starts with a proof of the theorem regarding the Fourier 
expansion of f(x). We shall impose conditions on /(x) different to the 
Dirichlet conditions [143] so as to simplify the proof. The proof of 
the Dirichlet theorem is given later. 

We take the Fourier series for the function f(z): 


where 


+n 
a,==f ti) cos kt dt; b= 


ap + > (a, cos kx + 6, sin ka), 
k=l 


i 
~~ 


+x 
f f (t) sin kt dt, 


(1) 
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the variable of integration being denoted by t so as to avoid con- 
fusion in later working with the variable x of expression (1). We 
substitute the expressions for a; and bx in (1) and find the sum of the 
first (2n + 1) terms of the Fourier series for f(x), the sum being 
written S,(f): 

n 


Sa (= + (a, cos ka + b, sin kx) = 


=1 


= 


+a n 
= = Í rol = (cos kt cos kæ + sin kt sin kz) di = 
+n 


l 1 x 
= Í (t) [= + > cosk(t— a) Jat. 
But we can write [I, 174}: 
sin (n -+) g +sin L 
> : 


2 sin —- 


1 + cosg + cos 2p +... + cos (n — 1)ọ = 


On replacing n by (n + 1) in this equation and subtracting a half 
from both sides, we get: 


sin nt DYP 
l 2 
-z + cosg + cos 29 +... + cos ng = s Tee 
2 sin -7 
whence 
i x sin Loe (t —x) 
z t © cos k(t — x) = ; ; (2) 


k=l 2sin 


—2& 
2 


so that we can write the previous expression for S,(f) in the form: 


P mnte 2) 
Sa (A => |A jo dt 
2sin 5) 


We make a periodic continuation of f(x), assigned in the interval 
(—x, x), so that it can be assumed to have period 27 and to be 
defined for all real x. The fraction appearing under the integral 
sign also has period 2x with respect to t, by virtue of (2). If we take 
into consideration the “remark” in [142], we can replace the interval 
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of integration (— x, x) in the above integral by any interval of length 
27x. We take any value x of the independent variable and use the 
interval of integration (x — a, æ + 2): 


zi (2n +1) (¢— 2) 
D 
=L Doe 
2sin 


2 


It must be repeated that n the future working /(x) is 
understood to be continued outside (—x, x) for all real x in the 
manner described above. 


x x+n 
We split the integral into two: firstly f and secondly J We 
X-a 


introduce new variables of integration z instead of f, given by t= 
= x — 2z as regards the first integral, and by t = x + 2z as regards 
the second. On changing the variables in the integrands and putting 
in the new limits of integration, we get: 


n 


2 
Saf) = | e 22) Ea de + 
0 


te | Ket 2a) Sm On the gz, (3) 


If we suppose that f(x) is equal to unity throughout (—z, x), the 
free term a,/2 of its Fourier expansion is evidently unity whilst the 
remaining terms are zero, i.e. S,(f) is unity for any n, and we have 
the following equation: 


2 _sin (2n + 1)z_ 
= 2 flan TIE de (m= 1,2,3...). (4) 
We prove a lemma before turning to the fundamental theorem: 
Lemma. If the interval (a,b) is all or part of (—a, x) and y(z) is 
continuous, or has a finite number of discontinuities of the first kind, 
in (a, b), the integrals 


b b 
= | ve) cosnzdz and — fyz) sin nz dz 
a a 
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tend to zero on indefinite increase of the integer n. If (a, b) is (— x, x), 
this lemma is precisely the same as the theorem of [147]. We now 
let (a, b) be part of (—z, x). We continue y(z) throughout the longer 
interval (— x, 7z) by making it zero in the parts of this interval outside 
(a, b), ie. we define a new function y,(z) such that ,(z) = y(z) for 
a<z < b and p(z) = 0 for z in (—2, 2) but outside (a, b). We can 
now write, for instance, 


b +x 
1 l 
= | ve) cos nz dz = = fur (z) cos nz dz, 


a -7n 


and this integral tends to zero by the theorem of [147] referred to 
above. It may be pointed out that y,(z) is also continuous, or else 
has a finite number of discontinuities of the first kind in (—z, x). 
The lemma is easily shown to remain valid if (a,b) is any finite 
interval. 

We now turn to the basic theorem for the expansion of f(x) in a 
Fourier series. As usual, we take f(x) as either continuous or possessing 
a finite number of discontinuities of the first kind in the interval 
N LTKT. 

If we multiply both sides of equation (4) by the factor f(x), take 
the factor under the integral sign and subtract the resulting equation 
from (3), we get: 

T 
8, () — Hle) = +f te — 22) — fey] E ae + 


0 


sin Z 


T 
+ [ (e+ 22) — pe SO ae, 
0 


which we can also write in the form: 


F 
8.) — Hae = S A o. E sin (an + 1) zdz + 
0 
T 
++ at at). - sin (2n + 1) zdz. (5) 


0 
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To prove that Fourier series (1) for f(x) is convergent to the sum 
f(z), we have to show that the difference (S,(f) — f(x)] tends to zero 
on indefinite increase of n. 

We consider the function 


_ f(x — 22) — f(x) — 2z 
v2) = — 2z "sinz 


in the interval (0, z/2). It can have discontinuities of the first kind 
originating from the discontinuities of f(x — 2z) and furthermore, 
the value z = 0 must be specially investigated. We suppose that f(z) 
is not only continuous at the point x that we have taken but also 
has a derivative. It follows from the definition of derivative and the 
obvious equation 


that y(z) tends to a definite limit, equal to —2/’(a), as z—> 0. Hence 
the above lemma is applicable to p(z), and the first term on the right- 
hand side of (5) tends to zero on indefinite increase of n. It can be 
shown in a similar manner that the second term tends to zero, whence 
it follows that the difference [S,(f) — f(x)] tends to zero at the point x. 
We thus obtain the following theorem: 

THEOREM. If f(x) is continuous or has a finite number of discontinuities 
of the first kind in the interval (— x, x), tts Fourier series is convergent 
to the sum f(x) at every point x at which f(x) has a derivative. 

It is easy to obtain a more general result. We suppose that at the 
point x the function is continuous or even has a discontinuity of the 
first kind, whilst the finite limits exist: 


fel ond tim LEEM e+ ee) 


h>+0 a 


lim 
h>+0 


The existence of these limits, i.e. of the derivatives to the left and 
right, is equivalent geometrically to the existence of a definite tangent 
to the left and to the right. We have in this case the following supple- 
ment to the theorem: if the finite limits (6) exist, the Fourier series 
for f(x) is convergent at this point to the sum [f(x — 0) + f(x + 0)]/2 
(or to f(x) if f(x) is continuous). 
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We can write on multiplying (4) by [f(~—0)+/(x% + 0)]/2 and 
subtracting from (3): 


Sa (f) — seer Eee ass = 


> 
| Hen Pe) AEO | sin (2m + 1) 2dz + 
0 


2z sin 


EX 
pa f(a + 2z) — f(w +0) . 2 sin (2n + 1) zdz. (7) 
0 


We have to prove that the right-hand side tends to zero on indefinite 
increase of n. 
We can say from the existence of limits (6) that both the fractions 


fe —22)—@—0) ge +22) He +0) 
— 22 22 


have finite limits as z > 0, and we can use exactly the same arguments 
as above to see that both integrals on the right of (7) tend to zero on 
indefinite increase of n. This proves the supplement to the theorem. 
For «=a and «= —mz, by the periodic continuation of f(z), 
limits (6) become: 
Se aid lim ites S 
h->+0 a 


lim 
h> +0 


and the sum of the series will be: 
f(— 2+ 0) + f(x— 0) 
3 ; 


It should be noticed that in all the examples considered in the 
previous section, f(x) satisfies at all points the conditions of the above 
theorem and its supplement. 


151. Second mean value theorem. For the proof of Dirichlet’s theorem and 
a more detailed account of Fourier series we need a proposition of the integral 
calculus that has some similarity to the mean value theorem given in Vol. 
I [I, 95}. The new proposition is usually referred to as the second mean value 
theorem, and may be stated as follows: if p(x) is monotonic and bounded and has 
a finite number of discontinuities in the finite interval a < x < b, whilst f(x)’ 
48 continuous, we have: 


b £ b 
f (x) f(x) dx = g(a +0) § f(x) dx + pb — 0) J f(x) da, (8) 


where è te a number belonging to the interval (a, b). 
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It is sufficient to prove (8) for an increasing (non-decreasing) p(x), since 
evidently if p(x) is decreasing, [—(x)] is increasing, and on applying (8) for 
[—(x)] and changing the sign on both sides, we get (8) for g(x) itself. It can 
be shown moreover that it is sufficient to prove (8) for the case of g(a + 0) = 
= 0. We assume that (8) is proved for this case and that p(x) does not satisfy 
the condition stated. We introduce a new monotonic function p(x) = p(x) — 
— g(a + 0). The end values of this function will be y(a + 0) = 0 and y(b — 0)= 
= g(b — 0) — g(a + 0). Equation (8) is applicable to y(x) by hypothesis, and 
gives, since y(a + 0) = 0: 


b b 
{f y(x) f(x) dx = y(b — 0) j f(x) dz, 
or 
b b 
$ [y(x) ~ pla +0)] f(x) dz = [ẹp(b — 0) — pla + 0)] J f(x) dz, 
whence 


b b b b 
$ plx) Kx) dz = g(a +-0)[ f f(x) da — J f(x) dr] + pb — 4 f(x) de, 
a a 


which leads immediately to equation (8) for g(x). All in all, therefore, it is suffi- 
cient to prove (8) for increasing, or more precisely, non-decreasing (x) for which 
g(a + 0) = 0. This type of function clearly has non-negative values in (a, b). 

We carry out the proof by subdividing (a, b) with the aid of the points: 


Ly = Q, Wy, Lygre ess Bia Vj-+ +, Cpt, Tp = b. 


We know from [I, 95] that: 


xi 
$ Hx) dx = f(&) (£i — £i), 


Xi—1 


where ¢; belongs inside (x;_,, %;). 
We form the sum: 


n n A 
Z PED NED iei) =X 6) | fe) az. 
z ra Xia 


On indefinite increase of n and indefinite decrease of the greatest of the lengths 
of sub-intervals (x;_,, x;), this sum tends to the definite integral (as we know 
from Vol. I), i.e. we have: 


b 
Í (x) f(x) da = lim 5 plé) f f(z) a 


i=l Xi 
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We now investigate the sum 


5 p($;) F f(z) dz = PEOI y ja) da~ § He ) dz] = 


i=1 Xizi =l Xi—ı 
a 
rën § He) d+ Slee = Em] f Kæ)a (9) 
Xi—ı 
The integrals 
b b b b b 
§ Ha) da, f f(x) da, § f(x) da,..., | f(w)da,..., | f(x) de (10) 
a Xı Xs Xtina Xn—1 


represent particular values of the function 
b x 
f Hx) da = — f f(a) de, (11) 
x b 


which is a continuous function of the limit of integration 2 [I, 96], so that all 
the values (10) lie between the least and greatest values m and M of function 
(11). 

If we take into account the fact that in (9) all the factors 


(61) and pfi) — P(r) 
are non-negative, and replace integrals (10) on the right of (9) firstly by m, 
then by M, we get: 


S (81) mo f(x) da > {9(&) + PAG) — 9(&;-1)]} m = (En) M, 


i=l Xi—ı 


2 ple) § fade <{o(&) + S'LeE)—p(eea)}} M = o1e,) M, 
ay i=2 
ie. 


(En)m < 2 pl) § fle) de < plën) M 


=1 Xia 


whilst in the limit, as n — co and the greatest of the lengths of the (x,_,, 7;) 
diminishes indefinitely, since we have 


En > 6—0 and p(n) > 9(b—0), 


the inequality becomes: 
b 
pb —0)m < J p(x) f(x) de < pb —0) M, 
a 
i.e. 


b 
$ p(z) f(x) da = p(b — 0) P, 
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where P is a number lying in the interval (m, M). But the continuous function 
(11) takes all values lying between its least value m and greatest value M in the 
interval (a, b) [I, 43], including the number P, so that it must in fact be possible 
to find a é in (a, b) such that 


b 
Sf (x) dx =P, 
£ 
which gives us 


b b 
f p(z) f(z) de TAR a) da, 


and this is the same as (8) since g(a + 0) = 0 by hypothesis. It may be mentioned 
that (8) can be proved without assuming continuity of f(x) and a finite number 
of discontinuities of g(x), though we shall not dwell on this. Finally, it must 
be pointed out that the more general formula than (8) may be proved: 


q 3 b 
f p(z) f(z) dz =A f f (x) dx + B f f (x) dz. 
a a é 


where the numbers A and B must satisfy the conditions A < g(a + 0) and 
B > ọ(b — 0). ; 

CoroLLaRy. We saw in [147] that, given certain conditions, the Fourier 
coefficients a, and 6, of f(x) tend to zero as n — oo, If f(x) satisfies Dirichlet 
conditions, it can be shown more precisely that for large n the coefficients are 
infinitesimals of order not lower than 1/n, i.e. we can write: 


M 'M 
lanl < 7: [bal < 7’ 


where M is a definite positive number. By hypothesis, the interval (— x, 7) 
can be divided into a finite number of parts in each of which f(x) is monotonic 
and bounded. Let (a, $) be such a sub-interval. The coefficient a, is the sum 
of a finite number of terms of the form: 


8 
= f (x) cos ng dx, 
a 


which can be transformed by the mean value theorem: 


8 B 
= fre) cos ng dx = Lja +0) [ cos na de +46 —0) | cos nz de = 
a $ 


a 


oe f (a + 0) (sin n — sin na) + f (B — 0) (sin ng — sin né) 


an 


This gives us for each separate term in the expression for a, an upper limit 
of the form M/n, where M = 2 | f(a + 0)|/ + 2 |f(8 — 0)|/x. There will clearly 
be a similar upper limit for the sum of a finite number of terms of this form, 
i.e. for a,. Analogous reasoning applies in the case of bp. 
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If f(x) is continuous and f(— z) = f(z), whilst the derivative f'(x) exists 
and satisfies Dirichlet conditions, we obtain, on integrating by parts and 
noticing that the term outside the integral vanishes since f(— 7) = f(z): 


+2 l +a +a 
nb, = Z J J(x) sin ng dz = =< f }(z) d cos nz = = f f’ (x) cos nx dz. 
-n —2 -n 


But the last integral, being the Fourier coefficient of f’(x) satisfying Dirichlet 
conditions, must have the same upper limit as above, so that with the assumptions 
made we get for bp: 


M 
[bal <P © 


We can write an analogous expression for a, A more detailed discussion follows 
later of the values of the Fourier coefficients in relation to the properties 


of f(x). 


152. Dirichlet integrals. It is clear from expression (3) that the question of 
the convergence of a Fourier series, i.e. of the existence of a limit of the sum 
S,(f), amounts to an investigation of integrals of the form: 


b 


sin mz 
[oe SE ae 
a 


We shall consider the simpler type of integral: 


b 
2 f ee) amme dz, (12) 


z 


a 


which is known as a Dirichlet integral. We prove the following lemma in regard 
to this: 

LEMMA. If p(z) satisfies Dirichlet conditions in the interval (a, b), we can say : 
(1) if a = 0 and b > 0, integral (12) tends on indefinite increase of m to the limit 
~(+0)/2; (2) if a =0 and b <0, the limit becomes p(—0)/2; (3) if a <0 
and b > 0, the limit ie {p(—0)+p(+0)}/2; (4) if a and b > Ooraandb < 0, 
the limit is zero. It can easily be seen that it is sufficient to prove only the first 
statement, from which the remainder follow without difficulty. We prove 
statements 3 and 4, for instance, on the assumption that the first has been 
proved: 


a b a 


1 sin mz 1 i sin mz i sin mz 
x | oe) a=- Jro EE a — = | oe) A ae. 
b 0 


If a and 6 > 0, both terms on the right-hand side here have the limit (+ 0)/2 
and their difference consequently tends to zero, which proves statement 4. 
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If a < Oand b > 0, we replace the variable of integration z by (—z) in the second 
term on the right, and obtain: 


z 


b ~a 
l sin mz l sin mz 1 sin mz 
— | oe) —— dz = — | re) AT. dz + = fe- z) oe dz. 
0 0 
Since b and (—a) > 0, we can apply statement 1 to both integrals and obtain: 


Lje sin mz sinm 4. + +-o+ 04+ p(—0) = HO) EEO) 1 


We now turn to the proof of statement 1, i.e. we prove that with b > 0, 
b 
sin mz l 
L fya ETE de> 4 pH 0. (13) 


We shall assume for the present in our proof that p(z) not only satisfies Dirichlet 
conditions but is also monotonic in (0, b). 
We had earlier the result: 


` sing n 
|A ue (14) 
0 
We consider the integral: 
e 
i sin x d 
x 


This is a continuous function of c, vanishing for c = 0 and tending to 2/2 
as c — + œ. We can infer from this that for all positive c the integral written 
remains less in absolute value than a definite positive number M. We now take 
the integral with two positive limits 


b 
sin x 
a 
We clearly have 
b a 
f sin x jz sin x w [2 sin x 
dz = 
x 
a 0 0 


and 


b b 
[2 de <| [ E as ja f de| < M+M =2M, 
a (X 
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ie. the absolute value of integral (15) remains less than a definite. positive 
number 2M for any positive a and b. 
Before proving (13), we consider the simpler integral: 


b . 
1 f sin mz 
n x 

0 


da. 


If we change the variable in accordance with t = ma and use (14), we get 
on. indefinite increase of m: 


1 ” ai 1 e l l 
( sin mg sin t a 
| a dea [Mast paz, 
0 0 
and consequently: 
1 2 
sin mx 
—| (+0) —_— de> >- (+ 0) 
0 


Hence it remains for us to show, in order to prove (13), that 


b 
= fie = oo 2% de +0, 


0 


i.e. that the absolute value of the left-hand side is less than any positive e for 
sufficiently large m. We split the interval of integration into two parts: (0, ô) 
and (ô, b), where ô is a small positive number which we fix later. We show that 
each of the two integrals: 


b 
sin mx 


sin mæ do ad =. Jo ) — »(+0)] ———— dz (16) 


ð 
Z fiee) 01 
0 


has an absolute value less than ¢/2 for sufficiently large m. Since g(x) has a 
finite number of discontinuities, we can take 6 sufficiently small for the interval 
(0, ô) to contain no discontinuity, so that p(x + 0) = p(x). On taking into account 
the fact that p(x) is monotonic, and applying the mean value theorem to the 
first of integrals (16), we get: 


sin mx sin mx 


+ fie — (+ 0)] ——— dr = tp (6) — p+ on | dr, 


and consequently: 


sin mx 


| ftre- 9+ 0)] ——— mE ae | < | 0) — e+ 0) -2M. 
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We have (6) — p(+0)— 0 as ô— 0 by definition of the symbol 9(-+0), 
and we can therefore take 6 close enough to zero for the right-hand side of 
the equation written above to be less than ¢/2. The absolute value of the first 
of integrals (16) will now be less than ¢/2 for any m. Having thus fixed the 
positive 6, we go back to the second of integrals (16). We write this in a new 
form by again applying the mean value theorem: 


sin mg sin mx 


ee Ars de + ero ~ e409 | dx. (17) 


The factors in front of the integrals are constants and it is sufficient for us 
to show that the integrals tend to zero on increase of m. We take say the first 
integral and change the variable in accordance with t = mx. We get: 


me 
| = dt. (18) 


mé 


The limits mô and mé increase indefinitely on indefinite increase of m, since 
ô is a fixed positive number and € is not less than ô. But since 


œ 

f sint dt 
7 t 
0 


is convergent, integral (18) must tend to zero on indefinite increase of both its 
limits [82]. Similar arguments apply for the second integral in (17), so that the 
expression as a whole tends to zero, i.e. the second of integrals (16) tends to zero 
and its absolute value is less than €/2 for sufficiently large m. 

We have proved equation (13), and consequently all the statements of the 
lemma, on the assumption that p(z) is monotonic as well as that it satisfies 
Dirichlet conditions. It remains to show that (13) is still true when 9(z) satisfies 
Dirichlet conditions only. These conditions imply that (0, 6) can be divided 
into a finite number of sub-intervals in each of which 9(z) is monotonic. Let 
us suppose that (0, 6) can be divided into three such sub-intervals (0, b,), (b,, 
b), (bz, b). Integral (13) is now split into three: 


b 
foo snme dz = 
0 
by . bs 
= | 9) Sam az Py eel sin mz de e+ fo sin mz ae (19) 
0 d, 


The lemma is applicable to each term on the right, since (z) is monotonic in 
(0, b,), (bi, b2), and (b,, b). Hence the first term tends to p(+0):2, whilst the 
remaining two tend to zero. It follows that integral (19) tends to ø(+0)/2, 
which it was required to prove. 
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It may be pointed out that the number m in Dirichlet integral (12) can increase 
indefinitely in any manner and is not obliged to take only integral values. 
The result obtained has its origin in the fact that the function (1/z) sin mz changes 
sign very often for large m and moreover, takes large values for z near zero. 


153. Dirichlet’s theorem. Dirichlet’s theorem [143] can be proved without 
difficulty by using the lemma of the previous section. We have to show, by (3), 
that the expression 


a n 


2 
1 in(2n +1 1 in(2n +1) 2 
= fie 22) =e Vere: fre +22) PE? ae (20) 
0 0 


tends to {f(z — 0) + f(x + 0)}/2 on indefinite increase of n. Instead of (20), 
we take the expression: 


oh 


z z 


T . 
0 


The upper limits are positive in both integrals, and the functions f(x — 2z) 
and f(x + 2z) satisfy Dirichlet conditions in the interval of integration. Further- 
more, m = 2n + l — œ, and by the lemma proved above, expression (21) 
tends to the limit {f(x — 0) + f(z + 0)}/2. Itremains to show that the dif- 
ference between expressions (20) and (21) tends to zero. For this, it is sufficient 
to show that the integrals 


L fje- m (hen an + nee 


n 


fie +22) (z —-}) sin (2n + 1) zde 


tend to zero. We shall prove this for the first integral: 


n 


3 3 
tfre- anz -= sin (2n + 1) z dz = — x | ve) z)sin (2n + 1) z dz, 


(22) 


= 


where 


ve) = fle — 2) (ir) 
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The first factor f(x — 2z) has a finite number of discontinuities of the first kind 
(or is continuous) in the interval of integration. The second factor 


z z3 z5 
1 l1 z=snz ” (4: - a t a) 
sinz z zsinz 2 23 25 
“(Fj + or + J 


tends to zero as z — 0 and has no discontinuities whatever in the interval 
(0, 2/2). Consequently the lemma of [150] is applicable to integral (22), and 
the integral tends to zero. The assertion of Dirichlet’s theorem is thus proved. 

We supplement the theorem with two further propositions which are stated 
without proof. The result obtained above reveals only that the Fourier series 
S[f(x)] is convergent to the sum f(x) at every point w of the interval; it says 
nothing about the nature of the convergence in the interval (— 7, x). This 
gap is filled by the propositions now stated: 

1l. In every interval lying inside (— n, n) in which f(x) not only satisfies Dirichlet 
conditions but is also continuous, the series S[f(x)] is uniformly convergent. 

2. If f(x) satisfies Dirichlet conditions and is continuous throughout (— n, %), 
whilst moreover 


K(— +0) = f(x — 0), 


the series S[f(x)] is uniformly convergent for all x. 

Dirichlet’s theorem places relatively few restrictions on the function f(x) 
to be expanded. Nevertheless, the theorem for expansion in Fourier series is 
not valid for any f(x) and there even exist continuous functions which cannot 
be expanded in this way. 

The reader will easily show that propositions similar to the above are valid 
for series expanded in cosines only or sines only for the case when the function 
is defined in (0, x), with the following changes: 

With the conditions of Dirichlet’s theorem for (0, 2), the sum of the series 


n 
o + J a,coska; aç = nA EO cos kt dt (23) 
2 k=1 a, 
0 
is equal to 
Se Cereus (24) 
and to 


(+0) for x=0; f(x — 0) for x=% 


whilst the sum of the series 
a na 
S b,sinke; by = 2 f f(t) sin kt dt (25) 
k=l 
0 


is (24) for 0 < x < 2 and zero for x = 0 and x = x. 

All these results are obtained very simply if f(x) is continued in the neigh- 
bouring interval (— x, 0) as was done in [145], the continuation being even in 
the case of series (23) and odd in the case of series (25). 
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154. Polynomial approximations to continuous functions. Our next 
task is to prove the closure equation (40) of [147]. The proof will be 
based on certain results in the theory of polynomial approximations. 
These results are important in themselves and their proof is based on 
the following theorem: 

THEOREM I (Weierstrass’s theorem). If f(x) is continuous in the closed 


interval a < x < b, a sequence of polynomials P, (x), P, (x), ... can be 
formed which tends uniformly [I, 144] to f(x) throughout the closed inter- 
val (a,b). 


We notice first of all that the interval (a, b) can be reduced to (0, 1) 
with the aid of the transformation x’ =(x—a)/(b—a), and polynomials 
in x become polynomials in x’ and conversely. We can therefore assume 
that the interval (a, b) is (0,1). We start by proving two elementary 
algebraic identities. We write down the binomial formula: 


n 
> Ch Um on™ = (u + vo)", (26) 
m=0 
On differentiating this with respect to u and multiplying by u, then 
carrying out the same process on the identity thus obtained, we arrive 
at the two new identities: 


Smog um yr—m — nu(u + v)" t, | 
m=0 
a (27) 
D m Or um on™ = nu(nu + v) (u + v)". | 
m=0 


On setting u = x and v = 1 — 7v in (26), we get: 
n 
l= YOg (1 — a, (28) 
m=0 


We now multiply (26) by n? q?, the first of (27) by (—2nx) and the 
second of (27) by unity, and add; this gives us, with u = x and v = 
=l] —~ z: 


> (m—nz? Cm am (1—ax)"—™ = na(1— x). 


m=0 
It is easily shown [I, 60] that the right-hand side of this equation is 
positive in (0, 1) and takes its greatest value for x = 1/2, whence it 
follows that 
n 
> (1 — nex)? 07 2 (1—2) < $n. (29) 


m=0 
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We now show that the polynomials 
4 m 
P, (2) = Sf (Z) Rer a — arn (30) 
m=0 


are uniformly convergent to f(x) in (0, 1). On multiplying both sides 


of (28) by f(x) and subtracting (30) from the equation obtained, we 
can write: 


fla) — Pa = S[Ae) i i)a țar. 


We have to show that, given a positive e, there exists an N, not depend- 
ing on 2, such that: 


n 
|= [Ka —f $5] On a (1 — zy <e for n>N. 
m=0 
Since the products C72" (1—2)""" > 0 for 0<2< 1, we have 


|S [1 —-1(F) Cran (1 — znr < 


n 
ù 


<3 


m=0 


Cram (l—zxyr-™, 


a-i) 


and it is sufficient to prove the inequality: 
n 
>lo- (=) 
m=0 


The function f(x) is uniformly convergent in (0, 1) [I, 35], ie. there 
exists a 6 such that | f(2,) — f(x) | < e/2 for | a, — x| < 6. Let us fix x 
from (0, 1). We split sum (31) in to two parts S, and S,, and carry into 
the first sum the terms for which m satisfies the condition |x — mjn|<6. 


The first sum consists of positive terms and we can write, in view 
of the choice of ô: 


OR (l— a mce for n>N. (31) 


S, < Se OR am (1 — x)", 
() 
where the (I) indicates that summation is over the m satisfying 


| x — mjn | < 6. If the summation isover all m from 0 to n, the sum 
can only increase, i.e. 


n 
Sı < S$ Orem — a-n = E Soma" (1— a)" 


m=O m= 


446 FOURIER SERIES [154 


so that by (28), S, < ¢/2 for any n. We turn to the second sum 


s= 5a) 


up! 
where summation is over the m which satisfy | x — mjn | > dor 
| ng — m | > nô, and evaluate it as follows. The function f(x) is 
continuous in the closed interval (0, 1) and must therefore satisfy in 
the interval an inequality of the form | fa) | < M, where M is a 
definite positive number [I, 35]; hence | f(x) — f(m/n) | < | f(x) |+ 
+ | f(mjn) | < 2M. In addition, we multiply the terms of S, by the 
factors (nz — m)?/n?6*, which are not less than unity. On taking 
outside the factors 2M and 1/n?ô?, which are independent of the 
variable of summation m, we get: 


CM x™ (1 —a)r-™, 


8, < SE > (m— ney? Om a™ (1 — arm. 
an 
All the terms are positive and the sum can only increase if the sum- 
mation is over all m from m = 0 to m = n. On taking into account 
(29), we obtain: 


M 
2nò? ` 


n 
8, < SAE > (m— na)? On am (1 — x)" < 
m=0 


M and ô are both definite positive numbers and it is enough to take 
M|2nô? < e/2, i.e. n > M|eô?, in order that S, may satisfy S,< €/2. 
We have now obtained the N = M/e6? which we required to find. 
For n > N, both S, and 8, < «/2 and inequality (31) is satisfied; 
Weierstrass’s theorem is thus proved. As can easily be seen, the 
theorem may be stated as follows: if f(x) is continuous in the closed 
interval (a, b) and e is any given positive number, there exists a poly- 
nomial P(x) in x such that the inequality is satisfied throughout (a, b): 


lx) — P(a)| < e. (32) 


On the basis of Weierstrass’s theorem, a similar theorem may be proved 
for periodic functions. 

TuroremM II. If f(x) is a continuous periodic function of period 2x 
and « is any given positive number, it is possible to find a trigonometric 
polynomial 


T(x) = c + 5 (cp cos kx + d, sin kx), (33) 
k=l 
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such that for any x: 
f(x) — T (£) < e. (34) 


We notice first of all that, due to the periodicity, it is sufficient to 
prove inequality (34) in the basic interval (— x, 2). We start by sup- 
posing that f(x) is an even function, and replace x by the new variable 
t = cos g, ie. x = arc cost, the principal values of this function being 
taken, so that as ¢ varies from 1 to (—1), 2 = arc cost varies con- 
tinuously from 0 to x. The function f(x) = f(arc cos t) will be continuous 
with respect to ¢ in (—1, 1) and in accordance with Weierstrass’s 
theorem a polynomial P(t) will exist such that 


|f(are cost) — P(l)|<e (—1<t<}), 


or on returning to the original variable, 
f(z) — P(cosz)i<e (O<a< az). 


Since f(x) is even its value is unchanged on replacing x by (—2) 
and similarly for P(cos x), since cos x is even; hence the inequality 
written is also valid for —x < x < 0, ie. it is valid throughout the 
fundamental interval. We know from [I], 176] that positive integral 
powers of sin x and cos v can be expressed linearly in terms of sines 
and cosines of multiples of the angle, i.e. P(cos x) can be written in the 
form (33); thus the theorem is proved. 

We now take any continuous periodic function f(x). If we put 


He) => (he) +l—a)]; vie) = +(e) —f(—2)], (35) 


f(x) is equal to the sum of g(x) and y(x), where p(x) and y(x) are respec- 
tively even and odd periodic functions. By what has been proved, 
there exists for a given ¢ a polynomial P(t) such that | g(x) — P(cos <x) | 
< ¢/2. If we can show that a polynomial Q(t) exists such that 


lp(a) — sin x Q(cos x)| < > (—ax<2x<2), (36 ) 
the trigonometric polynomial 


T(x) = P(cosx) + sin x Q(cos x) 


will satisfy condition (34). We introduce as before a new variable 
t = cos x and consider y(x) = (arc cos ¢) in the interval —1 <¢ < 1. 
In the manner of all continuous, odd periodic functions, p(x) vanishes 
for x = 0 and x = x, and consequently y(arc cos t) vanishes at the 
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ends of the interval, i.e. at t = +1. It follows from (30) that if f(x) 
vanishes at the ends of the interval (0, 1), i.e. (0) = f(1) = 0, the 
polynomial P,, (x) has the same property. We can transform the inter- 
val (0, 1) to (—1, 1) by using the transformation t = 2% — 1, which 
enables us to say that a polynomial R(t) exists, equal to zero at 
t = +1, and such that 


lp(arc cost) — R| <$ for —1<t<1. 


We can now write R(t) = (1 — #)R, (t), where R, (t) is also a poly- 
nomial, and the above inequality can be written in the new form: 


lp(x) — sin? xR; (cos x)| < > for O<a<a. (37) 


As regards the function sin æ R, (cos x) = VI — # R, (t), continuous 
in (—1, 1), we can find a polynomial Q(t) such that 


1 —# 2, O—Q| <> for —1<t<1, 
Le. 
jsin zR, (cos x) — Q(cos x)| < — for O0<a< aq, 


and all the more, 


|sin? xR, (x) — sin x Q(x)| < 2- ; (37,) 


since | sin «| < 1. It follows from (37) and (37,) that 


ly(x) — sin x Q(cos x)| < |p(x) — sin? x R,(cos x)| + 


-+ |sin? x R (cos x) — sin z Q(cos x)| < > + + = 5 , 
i.e. we have proved inequality (36) for the interval (0, z). But y(x) 
and sin x Q(cos x) are both odd, so that the inequality likewise applies 
throughout (—z, 2). 
The above proofs of Theorems I and II are due to Prof. S. N. Bern- 
shtein. 


155. The closure equation. The closure equation of [147] for systems 
of trigonometric functions follows fairly easily from the theorem just 
proved. We start by supposing that f(x) is a continuous function 
given in the interval (—z, x) and that /(—z) = f(x). 
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We obtain a continuous periodic function by periodic continuation 
of f(x) outside the interval. There will exist, for a given e, a trigono- 
metric polynomial T(x) which satisfies inequality (34). 

It follows from this inequality that 


+r 
sz | Ma) — Tha) de < e. (38) 


Let n be the order of the trigonometric polynomial, i.e. the value of m 
in expression (33). Now on the other hand, for any choice of trigono- 
metric polynomial of order not higher than n, integral (38) has a least 
value ež, obtained when we take as the polynomial the sum of the 
first (2n + 1) terms of the Fourier series for f(x). It follows from this 
that £n < £, and since we can take as small a positive € as we wish, 
we can say that en, which does not increase with increasing n, must 
tend to zero as n—> œ; and as we know from [147], this is equivalente 
to the closure equation for f(z). 

We now take the more general case when f(x) is continuous in (— x, 
x) but f(—2) and f(z) are distinct. As usual, a positive number M 
exists such that | Aa) | < M for —x < x < x. Leta positive number n 
be arbitrarily assigned and let the positive 6 satisfy the inequalities: 


mH 


ô< y? Ô <T (39) 
We construct a new function /, (x) in accordance with the following 
rule. We take f, (x) equal to f(x) in (—a, x — ô), whilst we take the 
graph of f; (x) in (x — 6, x) as the straight line joining the points 
x = x — ô, y= f(n — ô) and x = m, y= f(—x) (Fig. 126). Now 
fı (x) is continuous in (—x, z) and has the same value f(— x) for x = 
= +2, whilst evidently, as for f(x), | f, (x) | < M. 

By what we have proved above, for any given positive 7 a trigono- 
metric polynomial can be found such that 


+7 
se] he Tede < 4. (40) 


-n 


Since f(x) = f, (x) in (—x, x — ô), we have: 


+r x. 
sz |) — Ae) de = | (Ae) — Ala)? ae. 


n 
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Hence we can write, since | f(z) — fı (x) | < | f(z) | + lf (x) | < 2M: 


+n P 
=z [MA wPde <2 f a BES, 


or, by (39): ia 
1 w 
sz | eh (ade <p. (41) 
We now consider: D 
= i: [fx) — P(x)}? dx = xf. {L4lae) — fy (2)] + [fs (2) — T(2)]} de. 


-7 


Fie. 126 


On taking into account the obvious inequality (a + b} < 2(a? + b), 
we can write: 


a (Ma) — T(&2)]? de < 


E +n 4a 
<E f ehara | ie — Tade, 


and it follows from this, by (40) and (41), that 


+7 


ar | æ) — Tla) de <n. 


-7 


If we let n denote the order of the trigonometric polynomial T(x) and 
argue as above, we obtain from this e < n, and in view of the arbi- 
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trary smallness of 7 we have ¢,—>0 as n—> ©, ie. the closure equa- 
tion is also valid for f(x) with the properties stated above. It can 
be shown in exactly the same way that this equation likewise holds 
for f(x) bounded in (— x, x) and having a finite number of discontinu- 
ities. If all the discontinuities are of the first kind, there is no need 
to stipulate boundedness of the function. We can carry out the proof 
by isolating the points of discontinuity with narrow intervals then 
forming a new function f, (x), continuous in (—2, x) and coinciding 
with f(x) outside these intervals, whilst having a linear graph inside 
the intervals. By the above, a trigonometric polynomial 7x) which 
satisfies inequality (40) can be formed for f, (x), whilst the isolating 
intervals can be chosen narrow enough for inequality (41) to be satis- 
fied. The rest of the proof is as above. We have thus proved the 
closure equation for all functions having a finite number of dis- 
continuities of the first kind (or which are continuous). It may be 
mentioned that the equation holds for a much wider class of functions. 


156. Properties of closed systems of functions. We now consider some con- 
sequences of the closure equation, and instead of confining ourselves to 
systems of trigonometric functions, refer the discussion to any system of orthog- 
onal and normalized functions 


Pi (x£), Pz (2), Ps (x), seer Yn (x), eet (42) 


in the interval (a, b). We suppose that, with respect to this system, the closure 
equation is valid for any function with a finite number of discontinuities of 
the first kind. We shall only refer to functions of this type in future. We 
bring in the generalized Fourier coefficients of the function f(x): 


6 
Cy = J f(x) py (w) dx. 
a 
The closure equation has the form: 
b œ 
f aP dar = X ck. (43) 
a k=l 


We now turn our attention to some important consequences of this formula. 
1. If f(x) and p(x) are any two functions, with Fourier coefficients cy and dp: 


b b 
c= Í f(x) pk (€) de; dk = f p(x) vy (x) dz, (44) 
we have = 
b o 
$ Kx) p(x) de = 2, ck dp (45) 


the series on the right being absolutely convergent. 
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In fact, on replacing f(x) in (43) by f(z) + p(x), where & is an arbitrary 
constant parameter, we obtain: 


b o b 
$ Cx) + p(z) da = 2, LS Hæ) + plz) yy (x) de}, 
or, by (44): 


b b b 
S (fw) Pda + 2 f f(x) g(a) da + & f [9(x)P dr = 


= X ch+ 2 X edt X dh. 
k=l k=1 


k=1 


Comparison of the coefficients of like powers of £ gives us (45). 
The absolute convergence of the series appearing in (45) is obvious from the 
fact that 


1 
lex dil < -7 (ck + dk). 


since we know that the serice 2 (ce + dy) is convergent. 


2. If a function p(x), dependent on certain parameters, satisfies for all values 
of its parameters 


b 
f [e(x)# de < M, 
a 
where M is an independent constant, the series 
È kdg (46) 
k=1 


is uniformly convergent with respect to the parameters. 
The proof is based on a simple, yet at the same time extremely important 
inequality: whatever the real constants 


a; de,. Le ] an; Bio Bz» ...3 Ba 
we always have 
m m m 
( È arb) < È ake X Bk (47) 
k=l k=1 k=l 
the sign of equality being obtained only when all the a; and B, are in the same ratio: 
a E 
a Gener aoe 


In fact, let £ be any real number. We form the sum 


Sm = PA (tak — B,)*, (48) 


156] PROPERTIES OF CLOSED SYSTEMS OF FUNCTIONS 453 


which is clearly non-negative. The sum can only be zero in the case when 


Sa, — Pk = 0, k=], Deke, m, 
i.e. 
eee ee $m _¢ 
a, az a am , 


and evidently in this case: 
(S ay By)? = Xk: > bk. 


Generally speaking, on removing the brackets in (48), we can write an equa- 


tion of the form 
Sm = AG’ — 2B +0, 


where 


Since Sm must always remain positive, we know from elementary algebra that 
— AC < 0, i.e. we must have B? < AC, which gives inequality (47). 
We return to our proposition 2. We form the sum: 


n+p 
PJ ck A; 
k=n+1 
we have in accordance with (47): 
"sP HEE FA “ntp 
koe <| > Cke > dj. 
kent] k=n+1 


On the other hand, if we replace f(x) and c; in (43) by p(x) and d, respectively 
we are enabled to write: 


n+p 


di< X dh =f to(e)Pae < M. 
k=n+1 k=1 a 


‘The terms of the series PA cè are independent of the parameters by hypothesis, 
=1 

so that for any previously assigned small positive e we can find an N, independ- 

ent of the parameters, such that for all n > N and all p> 0 we have the 

inequality: 


"5P d= & 
k=n+1 A M 
With this, we get: 
n+p 
| & ¢d,|<e for n>N, 
k=n+l 


from which follows the uniform convergence of series (46). 
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3. If x, and x, are any two values from the interval (a, b), we have: 


$ fe) de = X og ye(e) ae, (49) 


the series on the right being uniformly convergent for all x,,x, of (a,b). 
If we knew that f(z) had the Fourier expansion 


f(a) = PA ck Plz) (50) 


and that this series was uniformly convergent, (49) would be obvious [I, 146]. 
The remarkable fact is that the formula is always valid, even if series (50) 
is not convergent, i.e. (50) can be integrated term by term just as though it 
were uniformly convergent and had the sum f(x). 
To prove (49), we put in expression (45): 


lifa<2r< 2, 
z)=;\ 
Oifa<a<a ora,<a2<b. 
The quantities x, and x, appear here as the parameters on which g(x) depends. 
The number M mentioned in proposition 2 exists, since 


b b 
f (ea) Pda < f de =b —a. 


We have furthermore, since p(x) is zero outside the interval (x, 2»): 


b Xs 
dy = Í p(x) y (x) da = f p(x) dx 
a Xı 


and, by (45): 


b Xs œ œ Xs 
f fa) (a) da = J fx) da = X eq de= 3 cx Sv (2) de, 

a x k=1 k=l x, 
which it was required to prove. 

Remark 1. On applying this to ordinary Fourier series, it can be shown 
that they can be integrated term by term as though they were uniformly con- 
vergent and had the sum f(x) not only for intervals inside (— x, 2) but for any 
interval in general. Here, the function f(x) must be continued periodically 
outside (— 2, x) as indicated in [143]. 

Remark 2. Inequality (47) is applicable to integrals as well as sums, in which 
case it has the form (Buniakowski’s inequality): 


b b b 
l $ h) ja (dr < f Ae) de: f fæ) de. (51) 
We see this by considering 


b b b b 
S Ih (2) + êf (x) Pda = f fi (æ) dæ +26 f fi (x) fa (=) dx + & f fF (2) de, 
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where é is any real number. It follows at once from the form of the left-hand 
side that this cannot be negative for real &. But if A + 2B + C& is non- 
negative for real é, we must have B? — AC < 0, i.e. B? < AC. Application of 
this to the above gives us inequality (51). 


157. The character of the convergence of Fourier series. The series obtained 
by us in [144] suffer from the defect that they converge badly. Some of them 
are not absolutely and uniformly convergent, for instance, series (10) [144] 
becomes for x = 2/2: 


l 1 1 
EAN 

which is not absolutely convergent; in addition, (10) cannot be uniformly con- 
vergent since it represents a discontinuous function [I, 146]. The series represent- 
ing the discontinuous function having the values c, and c, has the same defect. 
There is a relationship between the nature of the expanded function, its con- 
tinuity, and its Fourier series. We shall consider the relationship in detail here. 
We assume once and for all that the f(x) and its successive derivatives, to be 
discussed later, are functions satisfying Dirichlet conditions and periodically 
continued outside (— x, 2). Let 


x) , 2), ..., sO i 
denote the discontinuities of f(x) inside (— 2, z), and let 
Bp pyres ey t, -1 
denote the discontinuities of its derivative f’(x) inside (— 2, 7), whilst in general 
2, 28, o.o, a, 


are the discontinuities of the derivative f(x). It will be necessary to add to 
the discontinuities the ends of the interval (— 2, 2), if the limits: 


Fato), f(Fx+0),..., O (Fao) 


aro not the same. 

For the sake of symmetry, we write a) = — z and 2) = x, and similarly 
for the derivatives. The above condition for the derivatives amounts to the 
existence of a continuous f(x) inside any interval (2, a) (s =0,1,..., 
Tk-1)- By the Dirichlet conditions, the derivatives will tend to definite limits 
at the ends of the interval. 

We now transform the expression for the Fourier coefficients of f(z). We 
start with the coefficient 


+a 


a, = = f f(x) cos ng dg. 


-n 
We sub-divide the interval of integration (— x, x) into 


(—7, z), aW, £), (2 ., a), 
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so that f(x) is continuous in each sub-interval. Integration by parts gives us: 


frw cos ng da = sinne ——— _ f(x) — -+ fr (x) sin nz dg, 
Since, on the other hand: 
xf) x60) er 
f {(x) cos nz dx = lim 5 }(a) cos nz dz = 
x) P, >O x(O, +6 
(0) 
sin ng x=x 9 = a, 
= lim 227 ie) aS J f (2) sin ne dx, 
ef, 610 n x=x() +e 
x, 


we obtain in view of the continuity of sin nz: 


x 
f(x) cos næ da = = Ha — 0) — SREY Hao +0)— 
x (0) 
i-1 
KO 
— = | f (æ) sin ng da. 
xf, 


We finally have, on summing over ¢ from 1 to 7): 


ag = — Z fein nal? [aP +0) — f(a — 0) +...+ 


+ sin na? [ f(x? + 0) — f(a — 0)]} -— a f’ (x) sin ng da, 


-7 


where x0) = —z, x0) = +2, and by the periodicity of f(x), fae +0)= 
= = f(a + 0). In the ‘present case sin na) = 0, but we preserve the term for 
the sake of the symmetry of later expressions. 

For brevity, we denote the jumps of f(x) at its points of discontinuity «©, 
aO, n.a, s9, as respectively: 


8) = f(z +0) — fle —0);..., 3 = fla + 0) ~ fa —0). 
The formula above can now be written in the form: 


ee > (0: (0) — 52 
an = og sin NTI , ( ) 


where a, and b; denote the Fourier coefficients of the derivative f'(x). Similarly, 
on departing from the expression: 


cos nr 


fiw sin ng da = — ———— f(x) + Ir (x) cos ng da, 
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we obtain: 


a, 


l t 
b = Ü) 0 
n an A coana pr 


[S 


n ~ 
a (53) 
Expressions (52) and (53) are important in themselves, since they show 
that if a periodic function f(x) has jumps, ite Fourier coefficients are of order 
l/n as n— œ, the main parts of the coefficients a, and b, being respectively 

equal to: 
1 Ta To 
T he (0) gi (0). ecw M o (0) =A) 
oo a 6 sin na; = Pe cos ngt, (54) 

whilst their remainders are of higher order than \/n. 
The remainders are in fact: 
_ bn an, 
n’ n’ 
and since ap, bp are the Fourier coefficients of f’(x), they tend to zero as n > co, 
i.e. they become infinitesimals as n — œ. There is another reason for the 
importance of (52) and (53); by using them, we can distinguish in the Fourier 
coefficients a, and bn, which tend to zero as n — co, the components of different 
orders of smallness with respect to 1/n. 

To do this, we let a, b{ denote the Fourier coefficients of the kth 
order derivative f(z), whilst 6, ...., 6 are their jumps at the points 
a) een 

Dy eeens Dip $ 


J — fO (aH 4-0) — P (aF — 0). = O (a +0) — fO (a0). 


We apply (52) and (53) to coefficients a;, bp, which merely requires the 
eplacing of f(x) by f'(x), ô% by 6), x by a), and t, by 7,3 we get: 


A 1 ¢ a) (1) bn 
an = — — X OP sin nz —, 
an f n 
(55) 
, LS ww w, an 
b — OP cos ngi! —— 
n an Pa i ie aye? 
where ap, bp are the Fourier coefficients of f’(x). 
Repetition of the above argument gives us: 
la p” 
_ (2) sin ga) — On 
an=- za = 6? sin naf na 
i=l 
56) 
E ai 
bh= — Y 82 cosnx® 4 
n an È, i E Tig 


If we write for brevity: 


Tk Tk 
Ak= +3 6 sin na; B; = ate > óP cos na? (k =0,1,2,...), 
t=1 


T izl 
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we have from the above expressions: 


A B A -B Ok 
sails ~ m + m + T Serek a a 
B A k pa 
B A, 2 gk 
bn 7 n? po ga A nk? 


where Ops g, have different expressions depending on the form of the number 
k, as given in the following table: 


k 4m |4m-+]|4m-+2/4m-+ 3 


, 4 k 
ek | an? |— ba | —an? | bP 
ok | OY)? | aP | DP | — a? 


Here, a‘*) and b% are the Fourier coefficients of f(z). 

It is clear from the expressions for A, and By that these are dependent on 
n; however, n only appears under the sign of the trigonometric function, so 
that with fixed s, A, and B, remain bounded. The coefficients of the trigonometric 
functions in A, and B, consist of the jumps of the derivative f(x). If there 
are no jumps, A, = B, = 0. On the other hand, if f(x) satisfies Dirichlet 
conditions, the factors 9, and @;, which coincide except for the sign with one 
of the Fourier coefficients of f(x), will be of order not lower than 1/n for large 
n, since we saw from [151] that the Fourier coefficients of a function that 
satisfies Dirichlet conditions are of order not lower than 1/n. We thus obtain 
the following theorem: 

If a continuous periodic function f(x) has continuous derivatives up to and includ- 
ing the (k — 1)-th order, whilst the k-th order derivative satisfies Dirichlet conditions, 
the Fourier coefficients a, and bp of f(a) are of order not lower than Unk+, i.e. 


M `. M 
lanl < eee 5 [bnl < eer 


where M is a positive number. 

It may be remarked that, for k > 1, the Fourier series for f(x) is uniformly 
convergent. It follows in fact from the theorem proved that in this case a, 
and 6, satisfy the inequalities 


M. » M 
lanl < a 3 lbn] <a ’ 


whilst we can write for the general term of the series: 
2M 


ja, cos nx + bp sin naj < P 


whence follows the absolute and uniform convergence of the series, since 


l/n? is convergent [I, 122]. 
i g 
n= 
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Formula (57) remains valid for Fourier expansions in the interval (—I, l). 
We merely have to put 


Ik] R -Annat 
A, = (=) -— J ôP sin a 


M4 m 
en (k =0,1,2,...) (58) 
l TAE y nart 
= | — aa (*) ed 
By (=) = Phe cos —T* >» | 


whilst the expressions written in the table for @,, 0% have to be multiplied by 
(I/x)*, whilst now 


84 =O 0+0 — {9 U0) = #9 (— 14-0) —fO(— 1-0)... 69) 


158. Improving the convergence of Fourier series. As we saw above, the 
presence of terms of order 1/n in the expressions for the Fourier coefficients 
an and bn of f(x), which cause the slowness of convergence, are due to the jumps 
in f(z). However many derivatives the function may have in (— x, x), it only 
needs one jump at the end of the interval, or to be precise, distinctness of the 
limits f(-- z + 0), for its Fourier series to be unsuitable for practical purposes. 
Furthermore, the important thing in applications is often not the investigation 
of the expansion of f(x) itself but that of its first, second, or even third order 
derivatives. If the coefficients of f(x) itself are of order ntt, differentiation of 
the series brings the coefficients to the order 1/n*, as is clear from the equations: 


f(z) = “ + 5 (a, cos ng + bp sin ne), 
n=1 


[ (x)= X nib, cos nx — ap sin ng), 
n=1 
(2) = X n? (— ag cos nx — b, sin nz). 
n=l 


Conversely, the order of the coefficients is increased by unity with each 
integration, since 


[2 (an cos nx + bysin nz)dr =0 + X T bn COs ng F an sin na A 
n=1 


n=1 n 


where C is an arbitrary constant. 

The convergence of a Fourier series thus deteriorates on differentiation; 
if say the coefficients of f(x) are of order 1/n?, as is the case for f(x) continuous 
and periodic, where f'(x) may have discontinuities, the series obtained for f’(x) 
by term-by-term differentiation will have coefficients of order 1/n, whilst the 
series for f”(x) becomes quite meaningless, inasmuch as its coefficients do not 
even tend to zero. It may happen that the Fourier expansion of f(x) is entirely 
unsuitable for evaluation of its derivatives for any value of x, this being the 
case for an f(x) which, whilst possessing derivatives of any order at all other 
points, lacks a derivative merely at a single point. 
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The problem therefore urises of improving the convergence of Fourier series, 
i.e. of transforming them to new series in such a way that the new coefficients 
have a high enough order of smallness for the deterioration in convergence 
following on differentiation not to prevent the evaluation of derivatives. For 
instance, if we want to be free to evaluate the derivatives up to and including 
the third order by term-by-term differentiation, we want the original series to 
have coefficients of order not lower than 1/n®, since we then get coefficients of 
order 1/n? in the series for the third order derivative, this latter series being 
uniformly convergent and suitable for practical computations. 

We can improve the convergence of the Fourier series for f(z) as follows. 
Let there be terms of order 1/n in formulae (57), i.e. f(x) has jumps 6). 

A simple auxiliary function ¢,(z) can always be found, having the same 
jumps as f(z). The difference: 


f(z) = f(x) — p (2) 


now has no jumps, and its Fourier expansion S(f,) has coefficients of order at 
least 1/n?. The simplest choice for p(x) is the function whose graph is a step-line 
i.e. consists of a number of lines either parallel to OX, or with say constant 
slope m,; in the first case we have: 


Yo (t) = 0, ie. f(z) =f’ (x), 


fi (2) — F (2) = — my 


so that f(x) has the same jumps as f’(z). 
We take p(x) as having been defined in some such way. We have: 


Ha) = Po (2) + fi (2), 


where ¢,(x) is a known, extremely simple function consisting of sections of 
parallel straight lines, whilst f,(z) has a Fourier series with coefficients of order 
not lower than I/n?. We now improve f,(z). We have: 


I (2) = fi (2) +m. 
On carrying out the same process on f'(x) as above on f(x), we can write 
fi (£) = fs (£) + pı (x), 


where 9,(x) is a function consisting of pieces of parallel straight lines and f(x) 
has a Fourier expansion with coefficients of order not lower than 1/n?. Integra- 
tion of the last equation gives us an expression for f,(%), and hence for f(x), as 
the sum of a Fourier series with coefficients of order not lower than 1/n? and 
pieces of second degree parabolas. If we were to undertake the further improve- 
ment of f’(x), we should get an expression for f(z) as the sum of a Fourier series 
with coefficients of order not lower than 1/n* and pieces of third degree parabolas, 
and so on. 

The above method is chiefly used when the function is unknown and only 
its Fourier series given, with coefficients of the form (57). In this case, we 
have to find the points of discontinuity and the jumps of f(z) and its derivatives 
from the form of the coefficients, and afterwards apply our method for im- 
proving the convergence. 


whilst in the second: 
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An alternative approach is as follows: we can sum the parts of the Fourier 
series which derive from the first terms of formulae (57) for the coefficients 
an and bp. It is with these terms that the poor convergence of the Fourier series 
is associated. The Fourier series remaining after summation has better con- 
vergence than before. 

The following formulae must be used for the summation mentioned: 


+ (-n-2) (— 2xn<a2<0) 


wr Sin nx 
= 60 
a n ja~e (0< a < 2z) (601) 
0 (x = 0 and x = + 27) 
: 2m? + 6 3x? 
ee [ Patch Saw Be once <0) 
—— = 60. 
2 n? Qn? — bax + 32? we 
Qn? 2 3 
€, sinng = a =e Paes 
——— = 60 
2, n? Qa? 2 — 3am? -p 23 oe 


The first formula is obtained by expanding in sines the function (z — x)/2 
in the interval (0, x). The second is obtained by integrating the first with 
respect to x from 0 to x, whilst making use of the equation [144]: 


n? 


S41 
2 8 


Similarly, the third formula isfound by integrating the second. Further integral 
tion could give us further formulae of the above type. We assume here. that the 
interval has a length equal to xz, which can always be arranged for by a simple 
transformation of the independent variable. 

The above idea of improving the convergence of the Fourier series by gradual 
revision of f(z) and its derivatives, as also the example below, is due to Prof. 
A. N. Krylov.t 


159. Example. We consider the Fourier series: 


na 
g 2 "Cos -5 
(OST 2. wry inne (O<x<n). (61) 
We have here: 
TT 
2n cos 
btm, Ho ee 
c a(n? — 1) 


t O nekotorykh differentsialnykh unavneniyakh matematicheskoi fiziki. 
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In order to represent b, in the form (53), we expand the fraction 


n 
n? —i 


in powers of 1/n, as far as terms of order 1/n}: 


n 1 1 l 1 
n?— J n + n3 oe nm 1 
Lr 
n 
and 
2 cos — 2 cos ZZ 2 cos = 
r Zo 2 2 (62) 
g an an? an? (n? — 1) 
We thus have to sum the two series: 
cos Par sin ng cos = sin nz 
22 EJ 22 “Qe 
a > n and — OA ns : (63) 
n=l n=1 


We let S,(x) denote the first sum and rewrite it as: 


Formula (60,) can be applied to both these sums. We start with the first 
sum. As x varies from 0 to z, the argument (x + 2/2) varies from 2/2 to 
32/2, and (60,) gives: 


3 n n 
6 sinn (2 +3] i a— (2+5) ecg ee 
2 n a 2 4r ( i 


aim 


As regards the second sum, as x varies from 0 to 2/2, the argument (x — 2/2) 
varies from — 2x/2 to 0, whilst as x varies from 2/2 to 2, (x — n/2) varies 
from 0 to 2/2. 

Formula (60,) gives in this case: 


: n 
œ sinn |z — -y 


| 
EPs a : T (5 < z <a) 
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Addition gives us the following final expression for S,(x): 


n 
— (o<2<5) 
cos 22 sin ng 
3 2 2 2 7 rn (F<2<a] (64) 
PAG = =z ae E n 2 
n=] 


e (=F) 


We could evaluate the second of sums (63) by applying (60,), but a different 
approach may be used. Let this sum be denoted as S,(x). It is easily seen that 
integrating S,(x) twice with respect to x gives us —S,(x) to an accuracy of 
a first degree polynomial. On integrating (64) twice, we obtain: 


3 ass 
j (o<2< 4); (x ~ m8 ($ <e<a] 


62 2 62 
and consequently: 
3 $: + 
-$ +0ie +0; (o<2<4) 
S, (a) = i (65) 
ec — na n n n 
Seger ae tO: ($ <e<a]. 


We remark in regard to finding the constants C that the Fourier series 
for S(x) has coefficients of order 1/n8, whilst the series for S;(x) has coefficients 
of order 1/n?; thus both series are uniformly convergent to functions continuous 
at x = 2/2. It follows that the two expressions (65) must coincide, as must 
also their derivatives, at x = 7/2: 


m3 JEN , m3 „n a ” 
aBa Org HOS ggg OU eas 


n? , n? n 
Sa +0, = sa +0. (66) 
Moreover, it follows from the form of the second of sums (63) that S,(0) = 
= §,(x) = 0, which gives by (65): 
0,=0; n+ =0. (67) 


We can obtain all four constants from equations (66) and (67): 


, „n _ 7 m 
Cy 0i = g> C,=0; =~ -37> 
and on substituting in (65), we get for S(x): 
x3? n n 
ta + aE? (<< 5] 
S; (£) = 
(x — 1) n 


g ra T (Z <<): 
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We finally obtain the expression for series (61): 


nr 
2 2 cos —— 


f(z) = 8, (2) + 8, (2) - > OS -a mi) sin nz , (68) 
n=1 


which solves our problem. The runction f(x) is given in terms of known functions 
S,(x) and S,(x) consisting of pieces of straight lines and parabolas, and of a 
Fourier series with coefficients of order 


baat +25, 0 
n(n? — 1) ’ ns 


We are now free to evaluate the derivatives of the first- three orders of 
f(x), whereas it was impossible to differentiate series (61), which is itself non- 
uniformly convergent. 


§ 16. Fourier integrals 
and multiple Fourier series 


160. Fourier’s formula. We conclude our treatment of Fourier 
series by considering the limiting case, when the interval (—J, l) in 
which the series is investigated tends to (—°, +°), ie. l> +00, 

Let f(x) satisfy Dirichlet conditions, be continuous in any finite 
interval, and furthermore be absolutely integrable in (—°°, +20), i.e. 
there exists: 


+o% 
f |f (x) [de = Q. 


We have by Dirichlet’s theorem inside (—J, l): 


NIL } 


] (x) =F +S (2,008 + by sin i 


On recalling that 


+1 + 

I at 1 A ze 

an= 7 | f(t)oos "de, b= fhe sin ~~ dé, 
-l -i 


we obtain from this: 


w E 


41 l 
Ha) =a | 1 at-+ + f (0) cos = 2) ay, 


=l <1 
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What happens to this formula when l —> +o? The first term clearly 
tends to zero, since 


+1 i 41 l sh 
a 10a] <ar [lat < gy fO go. 
zi 2 J 


On introducing a new variable a, which takes equally spaced values 
in the interval (0, œ): 


a= 


afa 


with each time the increment 4a = xjl, we can write the remaining 
sum in the form: 


+! 
ae 4a { f(t) cosa (t — x) dé. 
@ i 


For large J, the integral under the summation sign will only slightly 
differ from 


i f (é) cosa (t — x) dé, 


and it can be foreseen that the whole sum will tend to the limit 
pL 
= { da | FO cosa(t— 2) de, 
6 -> 
as l — +c, so that we have: 
+œ +00 


f(a) == f da f F0 cosa ¢ — a) at. (1) 
0 -o 


It is only necessary to replace f(x) at points of discontinuity, if such 
exist, by 


f@+O +E (C—O) | 
2 


This expression, obtained when /-»>-- °° in the Fourier series, is known 
as Fourier’s formula. We have now arrived at the following proposition: 
if f(x) satisfies Dirichlet conditions in any finite interval and is absolutely 
integrable in (—°°, + °°), we have the equation, valid for all x: 


w -+œ 
= f da | f cosa(t— z)dt = EHI Fie., (2) 


0 -œ 
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The above is generally known as Fourier’s theorem, whilst the inte- 
gral on the left-hand side of the equation is the Fourier integral of the 
function f(x). The above discussion is not rigorous, though it can be 
made so with the help of certain auxiliary arguments. Instead of 
proceeding in this way, we offer an alternative rigorous proof, based 
on the results of [152]. 


Equation (2) will be proved if we can show that 


a +œ 
lim = faa f 1@cosae x) dt PAYT LED 
0 — o 


Jara A 
If J(4, x) denotes the integral on the left, we can write: 
+œ À 
Ja) =Z fioa [ cosa ada, (3) 
ae Ò 


i.e. we can change the order of integrations with respect to ¢ and a. 
This follows from the fact that, by the absolute integrability of f(x), the 
integral 


+œ 
f fÍ (t) cos a (t — x) dt (4) 


is uniformly convergent for all values of a. In fact, the integrals 


N’ -N 
f j (t) cosa (t — æ) dt, f j (t) cosa (t — a) dt (N < N’)... (5) 
N —N’ 


do not exceed in absolute value 
N’ 
f i76 de, (6) 
N 


and consequently, for a given € an N, can be found, independent of a, such 
that for all N and N’ > N, integrals (5) have absolute values less than e, this 
being a property of integral (6), by the absolute integrability of f(t). 

But now integral (4) can be integrated with respect to the parameter a 
under the sign of the integral [84], which gives us: 


2 + co + 00 A 
J, x)= f da f FE cosa (¢ — 2) a =z fioa f cosa t — 2) da. 
0 — o% -w i) 


The inner integral with respect to a on the right-hand side of (3) can be evalu- 
ated directly, which leads us to: 


thes a 
raat fro THEA a, 0) 
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so that it remains for us to find: 


fe l frm SPE? a. 


A> oo 


On splitting the interval of integration (— co, + co) into the two intervals 
(— œ, x) and (x, + œ), and replacing (t — x) by (—z) in the first and by z in 
the second, we can write (7) in the form: 
de +— |@e 

To 


sin hz sin Liz g 


Jaa) = F fie) 
0 


Both these integrals have the form of Dirichlet integrals, except for the infinite 
limits. In spite of this latter fact, they are easily shown to have the properties 
of ordinary Dirichlet integrals, i.e. we must have as 4 — oo: 


œ 


L fje- EE gyre 
f (8) 


sin Az 


dz ++ f(e +0), 


l foo] 
zhet 
0 
whence it follows that in fact: 


(age hee. 


which proves Fourier’s theorem. 

We still have to prove (8). We shall confine ourselves to proving the first of 
the expressions. Let e be any given small positive number. With z > 1, the factor 
(1/z)sin åz has an absolute value less than unity for any real A, whilst the function 
f(x — z) is absolutely integrable in (0, œ) by hypothesis. We can therefore find 
an N > 1 such that, for any å: 


lifre- me al <t fine—soiae <§- 
N 


If we take the Dirichlet integral with finite limits: 


N 


5 [ie 22) 
0 


sin Az dz, 


we can say that it tends to f(x—0)/2 as A — œ, i.e. for all sufficiently large A: 


sin Az 


ne dz — Sf (@—0) Sk 


Bi f(a — 22) : 
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We obviously have: 


1 fje- BE ayie- 
0 


N œ 
"SE 
0 N 


whence, by virtue of the last inequalities, we have for all sufficiently large A: 


o z 2 
ifie- witu njata 
0 


In view of the arbitrary smallness of €, this gives us the first of expres- 
sions (8). The proof of the second expression is precisely the same. 


A transformation of equation (2) is possible if f(z) is an even or odd 
function. 


We havein fact, on expanding cos a(t — x) as the cosine of a difference: 


f(z+0)+f(z—0) _ 
2 


ao 


+œ +o 
= = da| ro cos at cos ax dt + fro sin at sin ax dt], (9) 
0 — o — o% 


where both integrals with respect to ¢ clearly have a meaning in view 
of the absolute integrability of f(t) in the interval (—°°, +00), 

If f(t) is even, f(t) cos at is also even, whilst f(t) sin at is odd, and 
consequently: 


f f(t) cos atdt = 2 fro cos at dt, 
0 


+0 
f f@ sin at dt = 0, 
so that 7 


L 
2 ae 


cos ag da f f (£) cos at dt. 
ö 


On the other hand, if f(x) is odd, we obtain in a similar manner: 
Iæ+0+/e-0_ 2 f 
2 m 


sin az da f f (t) sin at dt. 
0 
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If f(x) is defined in (0, œ) only, it can be contined into the neigh- 
bouring interval (— °°, 0) in an even or odd manner, in which case we 
get two expressions for the same f(x), assumed for simplicity to be 
continuous: 


f(z) = f cosazda | f(t) cos at dt (x > 0), (10) 
0 0 

f(z) == f sinaxda | f (t)sin at dt (x > 0). (11) 
0 0 


It need only be borne in mind that with the first of these the f(x), 
contined evenly, gives a continuous function of x, so that the first 
expression is also valid for x = 0; whereas we get a discontinuity with 
the second expression if f(0) # 0, the right-hand side being equal to 
zero, and not /(0), at x = 0. 

The first integration in (9) is with respect to ¢, and on introducing 
the two functions 


+o +o 
A (a) == | f(t) cos at dt; B (a) =q Jf) sina de, 


we can rewrite (9) in the form: 
f(z) = f [A (a) cos ax + B (a) sin ax] da, 
0 


(x) being assumed continuous for simplicity. This latter expression 
gives us the expansion of f(x) in the infinite interval (— œ, +00) 
into harmonic oscillations with frequencies a varying continuously 
from 0 to +2; the functions A(a) and B(a) give the amplitude distri- 
bution laws and the initial phases in relation to the frequency a. For 
the finite interval (—I, 1), we had the frequencies a, = nz/l (n = 0, 
1, ...), forming an arithmetic progression. 
If in (10) we put 


(a) = JZ J ro cos at dt, (12,) 
0 
we can now write (10) as 


f(a = /2 Í fı (a) cos az da. (123) 
ò 
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In these two formulae, f(x) and fı (a) are expressed in terms of each 
other in precisely the same way. 

If we take f(x) as the given, and f, (a) as the required function in 
(12,), this expression now represents a so-called integral equation for 
f/,(a), inasmuch as this latter function appears under the integral sign 
(Fourier’s integral equation). Equation (12,) gives the solution of this 
integral equation. Similarly, we can write (11) in the form of the two 
expressions: 


f(a) = |/2 f rosin ave (13,) 
0 
zi 
f(x) = a f (a) sin ax da, (13,) 
lef 


Examples. 1. In expression (10), we put 


1 for O<@e@<l 
f(z) = 
0 , we>1 


We now get for the integral on the right-hand side of the equation: 


œ 1 
[asaji (t) cos aż dt = Í cos ax da | cos atdi = í 
0 0 

and 


cos az sin a 
da, 


0 
consequently 
| 1 for 0O<a<1 
2 [ een da = es 
m, a 2 
2 | 0 forz>1. 


2. On setting in (11) 
Í (2) = e™® (B > 0), 
we have on the right-hand side the integral: 


œ œ 


= f sinazda [ o= sin at at = Ž js 
n J a. 
(0) 


and we thus get 
e ee for z>0 


———, da = 
2 2 
aaia 0 for x= 0. 
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3. Similarly, on setting in (10): 


Í (a) = e F* (8 > 0), 
we find: 


a + Bo = 2B 


oo 
COS ax i 
f TaD 


0 


Fourier’s formula is often written in the complex form: 


+o +o 
= 1 i i 
Le TOE es | da | f (t) et dt. (14) 


— 0 


This is easily obtained from (2). If we substitute under the integral 
in (14): 
etx)! — cosa (t — x) + i sin a (t — 2x), 


we get the two integrals: 


+a 


+o +o +o 
ge f da f #0 cosa (t— x) at and al da Í f(t) sin a (t — æ) dt. 


In the second of these the variable a appears under the sine, so that 
the integrand is an odd function of a, and we consequently get zero 
on integrating with respect to a over (—Ħ2, -++ °°). Conversely, we have 
an even function of a in the first integral, and integration over (—°, 
-+°o) with respect to a can be replaced by integration over (0, °) 
witb the factor 2 written in front. Hence it is clear that (14) is equi- 
valent to formula (2). 

Assuming continuity of f(x), we rewrite (14) as 


+00 


+o 
fe) = 5 f eda f fi edt, 


whence it is evident that, as in the case of (10) and (11), it can be 
written in the form of the following two expressions: 


+œ 


fila) =z |0 edt, (15,) 


-œ 


+ 00 
f(x) = = fh (a) e-i da. (15,) 
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We notice one point in connection with the complex form of the Fourier 
integral. We cannot say that 


i + 0 +a 
gjd f Osina — a) ai 


— 00 


with infinite limits with respect to the variable a [82] has an ordinary meaning. 
We can only say that, for any finite positive M, 


a ae 
x | { f@sina@—a)a=0, 
-M =o% 


and we should therefore strictly write Fourier’s formula in the complex form as 


+M +o 
feo tim [ee da f f(t) o dt. 
-M -0 


2x M--+0 


Here, the lower limit tends to (— œ) and the upper limit to (+ co), whilst 
both have the same absolute value. A necessary condition for the existence 
of the improper integral in the ordinary sense is that a limit exists for any method 
of the lower limit tending to (— œ) and the upper to (+ 9). 


161. Fourier series in complex form. The method just described for 
writing the Fourier integral in complex form can likewise be applied 


to Fourier series. 
We recall the formulae of [146]: 


k sa k 
f(a) => + (a. cos == + b, sin =), (16) 
1 kad ly kat 
as = + | (foo dé; b= > f f (E) sin = dé. 
a 
We show that these are equivalent to the following: 


na 
T dé. (17) 
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Here the subscript n takes negative as well as positive integral 
values. We define separately Co c, and C., where k is a positive 
integer. We have by (17) and (16): 


i +1 
omg | ied = 


=~ Or TO ) (cos 4 — i sin 7È) ag = 2t, 


c=- Tie (cos = + isin) daft eee ‘ 


On substituting in series (17) and summing separately over the 
positive and negative subscripts, we get: 
ae Pra ka uai ib, oir 


f+ paga I 


Terms in the same k in the two sums written are complex conjugates; 
on combining the pairs into single terms, we get real quantities: 


f(@z)= y+ 25° 


= A, cos #72 + 6, sin r, 


apg — iby of aes en FE 
Se T 4M 
and the above expression for f(x) coincides with Fourier series (16) 
from which the equivalence of (17) and (16) follows. 


162. Multiple Fourier series. Fourier series and integrals can also 
be used for representing functions of two or more independent variables, 
For instance, let f(x, y) be a periodic function, of period 21 with respect 
to x and 2m with respect to y. On considering f(v, y) as a function of x, 
we have [161]: 


+œ i onx 


f(x, y) = = cy (Y) € T, (18) 


where 
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where 
i +m pran 1 +H 4m 2 
coma [eine Ranca | Srem oD agan. o 
-m -l -m 


Substitution of the expression obtained for cs (y) in (18) gives us 


to P alo Mier 
? 


fizy = > > on m 


O= 0 T= -0 
whence we obtain, on removing the brackets: 


jæ = S opo”), (20) 


0,tT=—~—a 


which is the generalization of the Fourier series for the case of two 
variables. 

Similarly, we have for the periodic function f(z, £», £4) of three inde- 
pendent variables, of period 2m, with respect to 2,, 2, with respect 
to x, and 2w, with respect to z,: 


res ia (751 A gars 
Í (Er, Xa, 2a) = = Coroso ® ML 9a 8", (21) 


Oi, O, 0 =— 0 
where 
w, O, Ws 


Conois T mana f f f FEL Èz $3) e 


—D, — Wy —Wy 


Sir (24% ans 


+ sis) dé, dé, dé. 
(22) 


On separating out the real parts of (20) and (21), we get the real 
forms of the Fourier expansions. 
Series (20) will now take the form, when l = m = x: 


f(a, y) = PA (tem cos na cos my + af m cos nx sin my + 


+ a®),, sin nx cos my + at), sin na sin my). 


We shall not write down the expressions for the coefficients or investi- 
gate the conditions for f(z, y) to have a Fourier expansion. We shall 
merely note the sufficient condition for expansion to be possible at a 
given point (2, Yo): (1) bounded partial derivatives 0f/dx% and df/ay 
exist everywhere; (2) the mixed derivative 3? //drdy exists in the 
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neighbourhood of the point (£o, Yo), at which it is continuous. Fourier’s 
formula has the form, for functions of two variables: 


Me.) = or f da, Í dé l da, I f(E, n) e'lat-9+aat -3 dy, (23) 


where the integrations with respect to a, and a, have to be carried out 
as indicated at the end of [160]. We have in the real form: 


C] œ œ +o 
x,y) =< | do, f dë f do, f #8, 1) cos a (Ẹ — 2) cos as (1 — y) dn. 
0 -œ 0 r] 
(24) 


This formula is valid if the function f(x, y), defined throughout the 
plane, is continuous, has first order partial derivatives, is absolutely 
integrable with respect to x in —co < x < +œ for any fixed y, and 
is absolutely integrable with respect to y in —œ < y < +0 for any 
fixed v. 

If, for instance, f(x, y) is an even function of x and y, we can write 
instead of (24): 


= =a] COS a, 7 aajo aé af cos a,7 da, jie, n) cos an dy. (25) 
0 ò 


n? 


Similarly, Fourier’s formula can be written for a hneig Flts La, +5 
Xn) of any number of independent variables. 


CHAPTER VII 


THE PARTIAL DIFFERENTIAL EQUATIONS 
OF MATHEMATICAL PHYSICS 


§ 17. The wave equation 


163. The equation of vibration of a string. The problem of integra- 
ting partial differential equations belongs to one of the most difficult 
and extensive sections of analysis and we confine ourselves to the con- 
sideration of certain particular problems in this field. The present 
article is devoted to problems connected with the so-called wave 
equation: 


ug ( Ou Ou 3u Ou z 
d (ae + aye t ar) a ane 
where 
eu Ou Ou 3 
Au = Ox? + age + gr = div gradu. 


We arrived at this equation in [116] and [118] when considering 
acoustic and electromagnetic oscillations. We suppose that u does not 
depend on y and z, i.e. that u has the same value at all points of a 
plane perpendicular to the v axis. In this case, the wave equation takes 


the form 
ug Pu 
3t? 3x? ’ 


the wave concerned being generally referred to as a plane wave. We 
now show that the same equation is obtained by considering the 
smali transverse vibrations of a taut homogeneous string. 

We understand by string a thin thread which can bend freely. We 
take it as acted on by a tension T, and directed along the z axis when 
in the equilibrium position with no external forces (Fig. 127). If we 
move it from the equilibrium position and moreover subject it to the 
action of a given force, it starts to vibrate, so that a point of the string 
with the equilibrium position N of abscissa x occupies the position M 


476 
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at the instant ¢. We confine ourselves to considering unly transverse 
vibrations, on the assumption that all the motion takes place in a 
single plane and that points of the string move perpendicularly to the 
x axis. We let u denote the displacement NM of a point of the string. 
This displacement is in fact the required function of the two indepen- 
dent variables x and t. 

We distinguish an element MM’ of the string, which has the equi- 
librium position NN’. On the assumption of small deformations, we 
neglect the square of the derivative du/dx by comparison with unity. 
Let a be the acute angle that the direction of the tangent to the string 
forms with the v axis. We have: 


ou 
Ou : tan a Ox Ou 
tana = — and sine = ——— = ———__ w _., 
Ox V1 + tan? a uy? Oa 
1+ (35) 


Let F denote the force acting on the string perpendicularly to the x 
axis, and reckoned per unit length. The following forces act on our 
element MM’: the tension at the point 
M’, directed along the tangent at M’ and 
forming an acute angle with the z axis, 
the tension at the point M, directed along 
the tangent at M and forming an obtuse 
angle with the œ axis, and the force 
Fdz, directed along the u axis. In view 
of our assumption of small deformations, 
we can take both the above tensions as 
equal in magnitude to T. We suppose that Fie, 127 
initially we have equilibrium of the string 

under the action of the force F. On projecting on to the u axis, 
we have the following equilibrium condition: 


Tsina’ —T,sina+ Fdz =0, (1) 


where a’ is the value of the angle a at the point M”, i.e. 


sina’ = ($2) ; sina = Qu 
T Loa Im? = (Zu 


and consequently: 


Tol(Se)ae — (Ge) + Fa = 9. 2) 
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The difference in square brackets expresses the increment of the 
function 0u/dz as a result of x changing by dz. On replacing this in- 
crement by the differential, we get [I, 50]: 


Ou =] _ Ou d 
or (ae m Oat O%: 

On substituting in (2) and cancelling dz, we arrive at the equilibrium 
equation of the string: 


Ou 


Toa + F=0. (3) 


In accordance with d’Alembert’s principle, the equation of motion 
is simply obtained by adding to the external force the inertia force, 
which we find as follows: the velocity of the point M is evidently 
du/dt, and its acceleration 0?u/dt?, so that the inertia force of the 
element MM’, given by the product of the acceleration and the mass 
taken with the reversed sign, is 


where ọ is the longitudinal density of the string, i.e. the mass per unit 
length; the inertia force, reckoned per unit length, is now 
Oa 
0 On? 
g being assumed constant. 
It follows that the equation of motion is obtained by replacing F 
in (3) by F — ee?u/dt?, which gives: 


Oru Ou 
Coe = Toe tF. 


On dividing by ọ and writing 


—. = aè, 


F 
e e 


— =f, (4) 
we obtain the equation of forced transverse vibration of a string: 
Ou 
=a 


Ou 
Oe gt BH (5) 


If external force is absent, we have f = 0, and we obtain for the 
equation of free vibration of a string: 


Ou Ou 
oe =O ae (9) 
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Another method of deducing (5), on the basis of Hamilton’s principle, 
will be found in Volume IV. 

We assumed above that the external force was distributed con- 
tinuously along the length of the string; but occasionally, we are con- 
cerned with a force P concentrated at a single point C. This case can be 
considered, either as a limiting case of the above, the force being 
assumed to act on an infinitesimal element of length « about the point 
C, whereas the product of its magnitude with e tends to a finite non- 
zero limit as € — 0, or else directly, by applying equation (2) to the 
element MM’ about the point C and replacing the Fdx appearing 
there by P. It may be noticed that we do not add to Fdg the inertia 
force (— (?u/at?) 9 dx), since we assume that the latter tends to zero 
as dx — 0. 

On assuming that the ends of the element approach the point C, 
and denoting the respective limiting values to which du/dx tends as we 
approach C from the right and the left by 


(se). (a) 


equation (2) gives us in the limit: 


r|), —()_J=-?.- (7) 


We see from this that the string has an angular point at the point C 
of application of the concentrated force, i.e. a point with different 
tangential directions to the right and the left. 

As is generally the case in dynamics, the equation of motion (5) is 
not sufficient in itself for a full definition of the motion of the string; 
we also have to give the state at the initial instant t = 0, that is, the 
positions of its points u and their velocities du/dt at t = 0 must be 
known functions of g: 


ul =P); Sl = ala). (8) 


These conditions, which have to be satisfied by the required function 
u at t = 0, are known as initial conditions. 

An infinite string may be considered in theory; equation (5) and 
conditions (8) are sufficient for finding the solution here, where g(x) 
and ¢, (x) have to be specified throughout the infinite interval (—o, 
+2). This case can correspond to the consideration of plane waves 
in unbounded space. As we shall see later, the results obtained for an 
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infinite string give us a picture of the distribution of disturbances 
in a bounded string also, up to the instant when disturbances reflected 
from the ends also make their appearance at the point concerned. 

If, however, the string is bounded on one or both sides at the points 
x= 0 and x=1, it is necessary to indicate what happens at the ends. 
For instance, let the end z = 0 be fixed. We must have in this case 


u | x=0 = 0. (9) 
If the end xz = 1 is also fixed, we likewise have: 
u |x=: = 0, (91) 


and these conditions must be fulfilled for any t. 

Instead of the ends being fixed, they may move in a given manner; 
the ordinates of the ends must now be given functions of time, i.e. we 
must have: 

u lezo = %1 (6); Wear = %2 (t). (10) 

Whatever the case, if the string is bounded on one or both sides, a 
condition must be a specified for each end, this being known as a 
boundary condition. 

We see from the above that the initial and boundary conditions have 
as great an importance as the actual equation of motion for the solution 
of a concrete physical problem, and moreover, that what we are inter- 
ested in is not so much finding arbitrary solutions or even the general 
solution as finding those solutions which satisfy the given initial and 
boundary conditions. 


164, D’Alembert’s solution. In the case of the free vibration of an 
infinite string, the required function u(z, t) must satisfy equation (6): 


Fu ga aes 


ot ~ © ga 
with initial conditions (8): 


0 
theo=P(2)s a) _ =A), 


where g(x) and q; (x) must be specified in the interval (—Ħ, + °°) 
since the string is infinite. 

The general solution of (6) can in fact be found, and in such a form 
that conditions (8) can easily be satisfied. 

For this, we transform (6) to the new independent variables: 


E=xv— at, n=x+at 
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or 
l 1 
=s(n+&); t= s(n—§8). 


On taking u as depending on x and t indirectly via ¢ and 7 and using 
the rule for differentiating functions of a function, we can express the 
derivatives with respect to the first variables in terms of the derivatives 
with respect to the new variables: l 


Ou ðu du, Ou (ðu Qu) 
te ty) ale) 
On applying these formulae a second time, we get: 
fu ð (ðu Ou ð (3u ðu Ou Ou 3u 
get = ar (ae + m) + ay a + m) E t a t R’ 
atu oa? (u Ou) gð (Ou Wu) _ fe g u y du 
C (3; aE) a- (a — ae) = e (ae a + d 
whence 
ou Ou tu 
“ae — Oo = — 4 ag, 
and equation (6) is seen to be equivalent to the following: 
Ou 
BE oq = 0. (11) 


We conclude, on re-writing (11) in the form 


seo 


that du/dé is independent of 7, i.e. is a function of & only. If we write 


ðu 
ae = 98), 
integration gives us: 


u = f0 (ë) dE + 4 (n), 


where 0, (n) is an arbitrary function of ņ (the “constant” of inte- 
gration with respect to € can depend on 7). The first term can be 
reckoned here as an arbitrary function of £, since 0(&) is an arbitrary 
function of £; on writing the first term as 0, (£), we have: 


u = 6, (È) + 4 (1), 
or, on returning to the old variables x and t: 


u (x,t) = 0, (£ — at) + 0, (x + at), (12) 
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where 0, and 0, are arbitrary functions of their arguments. This is 
the most general solution of (6) and is known as d’Alembert’s solution; 
it contains two arbitrary functions 0, and 6,. We define these by bring- 
ing in the initial conditions (8), which, in view of the equation 


2E = a [— 0; (x — at) + 0; (x + at)] 


and equation (12), give: 
0, (2) + O,(x) = p(x); — Oi (ar) + 05 (x) = BE) (13) 


or, on integrating and reversing the signs: 


x 


0, (2) — 9, (2) = —+[ pa (2)dz + C. 
0 


We determine the arbitrary constant O by putting x = 0: 
© = 0, (0) — 9 (0). 
We can take C = 0 without loss of generality, i.e. 
6, (0) — 6, (0) =0, (14) 
since, if we happened to have C + 0, we could replace 6, (x) and 6, (2) 
by the functions: 


6, (2) — S 


F? 0, (x) 


g 
2 , 


so that (14) is satisfied whilst equations (13) are left unchanged. 
Thus we have: 


0, (2) + O,() = (a); O (2) — blr) =— 4 fo dz. (15) 
0 


From these, we can easily find 0, (x) and 0, (x): 


x 


8 (2) = 3 9(8) a | Pale) des Bala) = gP + a fa dz. (16) 


On subsituting the expressions obtained in (12), we find: 
; 1 x—at 
u (x,t) = > p (% — at) — => Í pı (2) dz + 
6 


x+at 


+o(etat)+s— | aede, 
0 
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or finally: 


x+at 
u(i = PERS PEED 1 fawd an 


x—at 


Expression (17) clearly gives a twice continuously differentiable 
solution (the so-called classical solution) of the problem, if g(x) has 
continuous derivatives p(x) and ”(x), whilst p, (x) has a continuous 
derivative gi (x) for —2 < x < +2. It quite often happens, however, 
that we are concerned. with problems in which the initial disturbance 
is specified by functions which do not satisfy these conditions. For 
instance, if the string has a polygonal form at the initial instant, p(x) 
lacks a definite derivative at the vertices of the polygonal shape. 
Nevertheless, it is reasonable to suppose that (17) gives the solution 
of the problem in spite of u(x, t) not having continuous derivatives up 
to the second order everywhere. The problem is said to have a general- 
ized solution in this case. The theory of generalized solutions is dealt 
with in Volume IV. 


165. Particular cases. Formula (17) gives the full solution of the 
problem considered. It becomes more clearly comprehensible on dis- 
tinguishing various particular cases. 

1. The initial impulse is zero, i.e. the initial velocity of points of the 
string is zero. With this condition, p, (x) = 0 and (17) gives: 


u(x,t) =Le rere (18) 


together with, at the initial instant: 


U |t=0 = U (£, 0) = p (2). 


What are the physical interpretations of solution (18)? The numer- 
ator in (18) consists of two terms, and we shall dwell on the first, 
p(x — at). 

We suppose that an observer, departing from the point x = c of the 
string at the initial instant £ = 0, moves in the positive direction of OX 
with a velocity a, i.e. his abscissa changes in accordance with the 
formula: x = c + at or x — at = c. For such an observer, the displace- 
ment of the string, defined by u = g(x — at), will always remain con- 
stant, equal to ọ(c). The actual phenomenon defined by the function 
u = p(x — at) is known as the direct wave propagation. Returning to 
d’Alembert’s formula (12), we can say that the term 6, (x — at) gives 
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the direct wave which is propagated along the positive direction of OX 

with velocity a. Similarly, the second term 0, (% + at) defines a 

vibration of the string, the disturbance from which is propagated 

with velocity a in the negative direction of OX, in the sense that a 

point with abscissa c — at at the instant t will have the same deviation 

u as the point x = c had at t = 0. We call the corresponding pheno- 

menon the reverse wave propagation. 

The quantity a is the velocity of propagation of the disturbance or 
(transverse) vibration. We can see from (4) that 

=|=, (19) 

i.e. the velocity of propagation of transverse vibrations is inversely pro- 

portional to the square root of the density of the string and directly pro- 

portional to the square root of the tension. 

y j Solution (18) obtained above represents 

& the arithmetic mean of the direct wave 

gy(x—at) and the reverse wave 9(x-+at) and 

z4 0 ok X can be obtained as follows: we draw two 

Fia. 128 identical samples of the graph u = g(x) 

of the string at t = 0 and imagine that 

they are superimposed on each other 

then displaced in either direction with velocity a. The graph of the 

string at the instant ¢ is obtained as the arithmetic mean of the 

displaced graphs, i.e. it bisects the ordinates of these latter graphs. 


For instance, let the initial shape of the string be as shown in Fig. 128: 


0 outside the interval (—‘a,a) 
P(x) = w+a for —a<2<0 
—x+a for O<2<a 


Figure 129 illustrates the graphs of the string at the instants 


We take two axes at right angles on the plane: one for the variable x 
and the other for t. The v axis only is drawn in Fig. 130. Every point 
of the plane is defined by the two coordinates (x, t), i.e. every point 
characterizes a definite point of the string x at a definite instant ¢. 
We can now readily find by graphical means the points of the string 
whose initial disturbances arrive at the point x, at the instant tọ. 
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By the above, these will be the points with abscissae 7 + atọ, since a 
is the velocity of propagation of the vibration. We find these points 
on the x axis simply by drawing through the point (£o tọ) the two 
straight lines: 
Aa (20) 
x + at = to + alg, 


the intersections of these with OX being the required points. Straight 
lines (20) are known as characteristics of the point (xo, ty). Along the 


ay EN 
4 N 
4 ` Zz ~ 


-24 ~x BA k 2K 
4a 


AK, JAY 
Wis W P S 
TSK- L-K 2A k A 3A 
a 
Frc. 129 


Fre. 130 


486 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS [165 


first of these lines, g(x — at) retains a constant value, i.e. the line 
gives the values (x, t) for which the direct wave produces the same 
deviation as at (£o tọ). The second of lines (20) plays the same role 
as regards the reverse wave. We can say briefly that the disturbances 
are propagated along the characteristics. 

We can reveal the following facts on applying the construction 
indicated. 

Let an initial disturbance have been present only in an interval 
(a,, a>) of the string (Fig. 130), i.e. p(x) = 0 outside this interval. We 
confine ourselves to the upper part of the (a, t) plane, i.e. t > 0, which 
alone has a physical meaning, and draw the characteristics of the 
points a, and a, on OX, these being represented by full lines. These 
characteristics divide the total half-plane into sez domains. Domain 
(I) corresponds to those points at which both direct and reverse waves 
arrive at the given instant. Domain (II) corresponds to points at which 
only a reverse wave arrives at the given instant; whereas only direct 
waves arrive in domain (III). Points of domains (IV) and (V) are those 
at which no disturbance has arrived up to the given instant. Finally, 
the disturbance has had time to come and go through points of domain 
(VI), and these find themselves in a state of rest at the given instant, 
This is clear from the fact that if characteristics are drawn through 
any point M of this last domain, they will intersect OX at some point 
x = c outside the interval of initial disturbance, and consequently 
the values of (x + at) = (c) will be zero. Furthermore, if a line is 
drawn through M perpendicular to OX, the lower part of this line, 
which corresponds to the earlier instants with fixed x, will intersect 
at least one of the domains (I), (II) , (IH), whilst the upper part of the 
line, corresponding to later instants, will be situated entirely in domain 
(VI). This remarkable property, of returning to the original state after 
passage of the wave, is not possessed by the string with every initial 
disturbance, as will be seen below. 

2. The initial displacement is zero, and only an initial impulse is 
present. 

Here we get the solution: 

xtat 
ulz, t) = a J v.@ez. (21) 
a 


x—at 
If we write ®, (x) for the indefinite integral of @,(x)/2a, we have: 


u (x, t) = ®, (x + at) — B, (x — at), (22) 
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i.e. it is again a matter of the propagation of a direct and reverse wave. 
If the initial disturbance is confined to the interval (a,, a,), we get the 
same construction as in case 1, with the important difference, however, 
that the displacement in domain (VI) differs from zero and is given 
by the integral: 


a, 
sa | i (2) dz. (23) 

In fact, it follows from the construction of domain (VI) that we 
have for points of it x + at > a, and x — at < a, i.e. the integration 
in (21) has to be carried out over an interval that contains (a,, az). 
But 9,(z) is zero outside (a,,a,) by hypothesis, so that only the 
integral over (a,, a,) remains, and we obtain expression (23) for u(x, t), 
which represents a certain constant. 

The action of the initial impulse thus amounts to points of the 
string undergoing displacements in the course of time that are expres- 
sed by integral (23), after which the points remain without movement 
in the new position. 

Equation (21) can also be interpreted as follows. Let the point z lie 
to the right of the interval (a,, a), i.e. v > a,. The interval of inte- 
gration degenerates to the point x at t = 0, then with increasing ż, it 
extends to both sides with velocity a. It will have no points in common 
with (a, a.) for t < (x — a,)/a, p(z) will be zero, and (21) gives 
u(x, t) = 0, i.e. rest at the point x. Starting at the instant t = (x—a,)/a, 
the interval (x — at, x + at) will overlap with (a,, a), in which 9, (2) 
differs from zero, and the point v starts to vibrate (the instant when 
the front of the wave passes through x). Finally, with t > (w—a,)/a 
the interval (x — at, x + at) will contain the entire interval (a,, az), 
integration over (x — at, x + at) reduces to integration over (aj, a.) 
since p (Z) is zero outside the latter by hypothesis, and we thus have 
now a constant value for u(x,t), given by (23). The instant t = 
= (t—a,)/a is when the rear front of the wave passes across the 
point x. 

We notice some details regarding the general case. It can happen 
in the general case that the direct or reverse wave is entirely absent. 
Suppose, for instance, that the functions g(x) and g, (x) appearing in 
the initial conditions satisfy the relationship: 


¥9(2) + 55] r1(2)de=0. (24) 
0 
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With this, by the second of expressions (16), the function 0, (x) is 
identically zero, and the reverse wave is absent in the general solution 
(12). If we replace the zero on the right of (24) by a constant, 6, (x) 
becomes constant and its value can be absorbed into 0, (x — at) in (12), 
i.e. reverse wave is again absent. Let us return to the example that 
we considered in case 1. Figure 128 gives the graph of the initial 
deviation (the initial velocity is zero everywhere). The last of Figs. 129 
gives the graph of the string at the instant ¢ = tọ consisting of two 
separate pieces. The right-hand piece corresponding to the interval 
(a, 3a) will move to the right, and the left-hand piece to the left, with 
velocity a. But we can describe the later phenomena with t > t, by 
taking t = t, as the initial instant, calculating the deviation u and 
velocity du/6t for this instant, and applying the general formula (17), 
in which it is only necessary to replace t by (t — to) on the right-hand 
side, tọ having been taken as the initial instant. In this case, the 
initial conditions only differ from zero in the intervals (—3a, —a) 
and (a, 3a). In the general case, the disturbances in each of these 
intervals might give both a direct and a reverse wave. Here, however, 
as we saw above, the disturbances say in the interval (a, 3a) give only 
a direct wave. This happens because in this interval, apart from the 
initial deviations illustrated in the last of Figs. 129, velocities are 
excited as a result of the vibration at t = t, such that the reverse wave 
is absent. Similarly, the disturbances over the piece (—3a, —a) do not 
give a direct wave. This phenomenon corresponds to one of the for- 
mulations of Huygens principle. 


166. Finite string. Let the string be finite and fixed at the ends 
and let the abscissae of the ends be x = 0 and g = l. 
In addition to the initial conditions (8): 


Ou. 


th gO) F leg 7 oe) 


where p(x) and q(x) are specified for 0 < x <l, we also have to 
satisfy the boundary conditions: 


Ulpng =O; ulk = 0. (25) 
d’Alembert’s solution (12): 


u (x, t) = 0; (x — at) + 6, (x + at), (12) 
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is of course suitable for this case, but the definitions of the functions 
0, and @, in accordance with (16): 


6, (2) = 40 (2) — 5 >Í pı (2) dz; 
(26) 


3 (2) = z P (2) + 32 sf yı (2) dz; 


meet with the difficulty here that g(x) and ¢, (x), and consequently 
also 6, (x) and 0, (x), are only defined in the interval (0, l) in line with 
the physical meaning of the problem, whereas the arguments (% + at) 
in (12) can lie outside the interval. 

Consequently, for the characteristics method to be applicable we 
have to continue the functions 6, (x) and 0, (x), or what is exactly 
equivalent, g(x) and 9, (x), outside the interval (0, l). From the physi- 
cal view-point, this continuation amounts to defining the initial dis- 
turbance of an infinite string such that the motion of its part (0, l) is 
the same as if it were fixed at the ends and the rest of the string re- 
moved. 

On substituting x = 0 and æ = l in the right-hand side of (12), the 
boundary conditions become, on equating the results to zero: 


ee alate a 
6, (l — at) + 0, (1 + at) =0, 


or, if we write the variable argument at simply as zx: 


6, (— 2) = — 0(z); | ai 
0, (l+ x) = — 0, (l — 2). 
When « varies in (0, l), the argument (l — xv) also varies in this interval, 
and the right-hand sides of equations (28) are known to us. But the 
arguments (—2) and (l -+ x) now vary respectively in the intervals 
(—}, 0) and (l, 21), and the second of equations (28) gives us values 
of 0, (x) in (l, 21), whilst the first gives @, (x) in (—l, 0). Furthermore, 
as œ varies in (l, 2l), the argument (l — x) varies in (—1, 0), and the 
right-hand sides of equations (28) are known to us on the basis of the 
above working. Arguments (—x) and (l+ x) now vary in the intervals 
(—2l, —l) and (21, 31), so that (28) gives us 0, (x) in (2l, 3l) and 0; (x) 
in (—21, —1). We can see by continuing in the same way that (28) 
gives us the determinate values for 0, (x) with x < 0 and @,(z) with 
x > l, which we require for applying (12) with t > 0. Similarly, if x 
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varies in the interval (—1, 0), the left-hand sides of equations (28) are 
known, and we obtain 0, (x) in (—l, 0) and 9, (z) in (l, 21). With x 
varying next in (—2l, —l), we get 6, (x) in (—2l, —l) and 0, (x) in 
(22, 31) and so on, i.e. equations (28) give us 0, (x) and 90, (x) defined 
for all real v. 

If we replace x by (l + 2) in the second of equations (28) and make 
use of the first equation, we get: 


0, (x + 21) = — 0, (— x) = 0; (£), 


i.e. the function 6, (x) is shown to have the period 2l. It now follows 
from the first of (28) that 6, (x) also has the period 2l. This implies 
that we in fact know 6, (x) and 0, (x) for all real x provided simply 
that we carry out the first of the operations described above for con- 
tinuing these functions, i.e. it is sufficient to let x vary in (0,2) only. 
Equations (28) give us 6, (x) in (—l, 0) and 0, (x) in (J, 2l), ie. 6, (x) 
is known in (—J, +1) whilst 6, (x) is known in (0, 2). The remaining 
values of the functions follow from their periodicity. 

Having thus defined 6, (x) and 0, (x), the functions p(x), p, (x) are 
readily continued, since we have by equations (26): 


p (2) = 6, (2) + 0a (2); =f pi (z) dz = 6, (2) — 0, (2), 
0 


Py (x) = a [0; (x) — 0; (2)]. 


We obtain on replacing x by (—2) in the first of equations (28) and on 
differentiating also: 


9, (2) = —O,(—a); 0) (— 2) = 63 (x); 6; (x) = 0, (— 2). 


We use these relationships and the first of equations (28) to enable us 
to write: 


p (— z) = 0, (— x) + 6, (— £) = — 6, (x) — 0; (x) = — 9 (2) 
pı (— 2) = a [803 (— x) — 8i (— x)| = a [0i (x) — 02 (2)] = — p, (2), 
i.e. we obtain an extremely simple rule for continuing g(x) and 9, (x): 
they are continued from the interval (0, l) to (—J, 0) oddly, and there- 
after periodically with period 2l. If, with this, we obtain functions 
g(x) and q, (x) along the entire x axis such that g(x) has continuous 
derivatives g'(x) and g”(x), whilst p, (x) has a continuous derivative 


gi (x), we shall have by (17) a twice continuously differentiable 
solution of our problem. 
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We return once more to the zt plane. Since the string is finite, 
we need only consider the strip of the upper half-plane ¢ > 0 included 
between the lines x = 0, x = 1 (Fig. 131). The physical significance 
of the solution (12), in which the functions 
6,(x) and @,(x) are defined for all v, as shown 
above, may be explained as follows. Having 
drawn characteristics through the points O 
and L to their intersections with the op- 
posite boundaries of the strip, we draw new 
characteristics through these points of inter- 
section to their further intersections with 
opposite boundaries of the strip, and so on. 

We divide the strip by this means into 
domains (I), (ID), (HI), ... Points of domain 
. (I) correspond to the points of the string at 
which only disturbances from interior points 
have arrived, and consequently the infinite 
parts that we imagine to have been added 
to the string have no effect here on the 
motion. At points in domain (II), wealready have disturbances coming 
from the imaginary parts; we shall take, for instance, the point 
My (£o to) of domain (II). 

Since 


Fre. 131 


u ( Xp, to) = O; (Zo — ato) + Oy (To + ato) , 


there are two waves present at this point: firstly the direct wave, from 
the initial disturbance at the point M, of the string with abscissa 
£ = Ta — alp and secondly the reverse wave from the point M, with 
abscissa x = Zo + aty, M, being in the present case a real point of the 
interval (0, l), and M, an imagined point. The latter is readily replaced 
by a real point by noting that, by (28): 


0z (Zo + ato) = b; (L + To + ato — l) = — 0, (21 — zo — ato) » 


so that the reverse wave 0, (£o + ato) is in fact the same as the direct 
wave —0, (2l — £ — atọ) from the point of initial disturbance 
M; (2l — £ — ato) (symmetrical to M, about L), which, after arriving 
at the end Z at the instant 


l — (21 — zo — ato) __ Zo +ato—l 
a _ a a 


t= 
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has reversed its direction and sign and arrived in this form at the instant 
to at Mo; in other words, fixing the end x = l has led to a reflection of 
the displacement wave, corresponding to a change in sign of the displace- 
ment whilst the absolute value is retained. 

We find the same phenomenon with waves arriving at the end 
x = 0; we have two waves at points of domain (III): the reverse and 
the direct, reflected from the end z = 0. We obtain at points indomains 
(IV), (V), (VI), ... waves which have undergone several of these reflec- 
tions at both ends of the string. 

If instead of a boundary condition (25) we had, say, at the end x = l, 
the conditionf: 


ou —_ 
Ox xt” 
we should have, instead of the second of equations (27): 
6; (l — at) +6; (1+ at) =0, 
or, on again replacing at by gx: 
0 (l + x)= — 0 (l — z). 
Integration of this relationship clearly gives us: 
6, (1+ 2) =0,(l—2)+C, 


where C is a constant which we can take as zero without loss of 
generality, the proof of this being left to the reader. We thus have 


0a (l+ 2) = 0, (l — 2). (29) 


The physical interpretation of this condition is likewise reflection 
at the end x = l, though both the sign and magnitude of the displacement 
are now preserved. 


A remarkably simple example of the use of the above method of charac- 
teristics and reflections is given by a “plucked string”, which is stretched out 
at one point at the initial instant with zero initial velocity. The reader may 
easily verify the method given below for finding the shape of the string at any 
instant t, the initial shape being assumed given. 


+ This is met with in the theory of longitudinal vibrations of rods, which 
obey the same differential equation (5) or (6) with different physical values of 
the constant a. The condition imposed means that the end of the rod is free. 
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OAL gives the initial shape of the string in Fig. 132, the dotted line being 
the corresponding symmetrical figure with respect to the mid-point of the 
string x = l/2. Let the perpendicular AP to OL be continued to its intersection 


B’ with A’L. Let C be the mid- 
point of AB’, which gives us 
the direction LC. 

The shape of the string at any 
given instant is now obtained 
by moving a line of intersec- 
tion parallel to LO from the 
point A to the point A’; in 
particular, at the instant t= lja, 
the string takes the dotted 
polygonal shape OA’L. 

Figure 133 illustrates the successive forms of the string 
at the instants: 


Fia. 132 


eae Ze Seog 
E Mace) Neem” Waa 2 


167. Fourier’s method. The transverse vibra- 
tions of a string fixed at its ends may also be 
treated by means of Fourier series. Although this 
method is not as simple as the above in this par- 
ticular case, it is worth describing because it can 
be used in many other problems for which the 


method of characteristics is inapplicable. We again 


equations of the problem, in a different order: 


Bu  , Pu 
a Oa? ? 
trap = 0, ulsa =0, 
ðu 
w|_ j= 92) > OF leo pı (x) 


Fie. 133 


write down the 


(30) 
(31) 
(32) 


Instead of seeking the general solution of equation (30), we seek a 
particular solution as the product of two functions, one of which depends 


only on t and the other only on x: 
u =T (t) X (x). 
We have on substituting this in (30): 
X (x) T” (t) =a? T (t) X” (x) 
or 


T) _ X” (2) 
aT) X(«) 


(33) 
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The function on the left of this equation depends only on ¢, and that 
on the right only on g; equality is only possible if both left and right 
are independent of both ¢ and v, i.e. both sides represent the same 
constant. 

We write this constant as (—k?): 

T” (t X” (x 
a z TG eer ASE) 


hence we get two equations: 
X” (a) + KX (a) = 0; T” (t) + æT (t) = 0. (35) 
The general solutions of these equations are, with k + 0 [27]: 
X (x) = C cos kx + D sin kz; T (t) = Acosakt + Bsinakt, (35,) 
where A, B, C, D are arbitrary constants. 
We obtain for u, from (33): 
u = (A cosakt + B sin akt) (C cos kx + D sin kz). (36) 


We now choose the constants so that boundary conditions (31) are 
satisfied, i.e. the factor in x of (36) vanishes for z = 0 and v =l. 
This gives: 


C-14+ D-0=0; Ccoskl + D sin ki = 0. 


It follows from the first equation that O = 0, and the second gives 
D sin kl = 0. If we take D=0, we have C= D = 0 and (36) is 
identically zero. This is a trivial solution, so that we must take 
D #0 but sin kl = 0. 

We thus get an equation giving the parameter k, which has so far 
remained entirely arbitrary: 


sin kl = 0, 


2 
katt, 45,04, £7... (37) 


If we substitute k = na/l or k = —nz/l in (36), the only difference 
is in the sign of the sines, and in view of the presence of the arbitrary 


+ If we had written (+4?) for the constant in (34) instead of (—k?), we should 
have obtained X(x) = Ce + De~™ and it would have been quite impossible 
to satisfy boundary conditions (31). 

The same situation is obtained with k = 0. Similar remarks apply to later 
problems for which we use Fourier’s method. 
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constant factors the two solutions will be essentially the same. Hence 
it is sufficient to take only positive k from values (37). On putting 
C = 0 in (36) and writing A and B for the arbitrary constants AD 
and BD, we get: 


u = (A cos akt + B sin akt) sin kz. 


We still have to substitute one of the values (37) for k. We can take 
different values for A and B on substituting different values of k. 
This leads us to an infinite set of solutions: 


Un = (4, cos + By sin) sin" (n = 1,2,...). (38) 


These solutions satisfy both equation (30) and boundary conditions 
(31). We now notice that, due to the fact that (30) and (31) are linear 
and homogeneous equations, if %4, ug, ... are solutions satisfying these, 
their sum likewise satisfies the equations (as in the analogous case of 
ordinary linear homogeneous equations). Hence we have the following 
solution: 


u = 2 (4, cos 27 + Bn sin 27”) sin Z, (39) 


It remains to choose A, and B, so as to satisfy also the initial con- 
ditions (32). We differentiate solution (39) with respect to ¢: 


Ou $ nna ._ nnat naat) . nag 
= (T Asin" + Bac cos") sin. (40) 


On setting ¢ == 0 in (39) and (40), we get by (32): 
g (2) = > Ansin TS, pil) = Br sin E, (41) 
n= n= 
The series written represent sine expansions of the given functions 
g(x) and g(x) in the interval (0,1). The coefficients of these ex- 
pansions are defined by the familiar formulae of [146], which leads 
us to the following values for A, and B,: 


=o sin dz Br = 2 [ne sin“ dz. (42) 


On substituting these values in (39), we obtain a series which for- 
mally meetsallour requirements. We give below the sufficient con- 
ditions to be imposed on g(x) and p; (x) so that the sum of the series 
in fact provides the solution of our problem. 
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168. Harmonics and standing waves. We bring in the amplitude N, 
and initial phase gy, of a harmonic: 


An cos = +B, sin = N,, sin Ga + Pn): 


Each term of series (39) gives a solution of the problem: 


(4n cos 7e + B, sin =) sin = = N, sin Ce + Pn) sin" (43) 
and represents a so-called standing wave, such that points of the string 
carry out harmonic vibrations of the same phase and with the ampli- 


tude 


« UNL 
N,, sin T 


depending on the position of the point. The string emits a sound with 
this type of vibration; the pitch depends on the frequency of the 
vibration: 


whilst the strength depends on the maximum amplitude of the vibra- 
tion Nn. On assigning the values 1, 2,3, .. . to n, we get the fundamental 
tone of the string and a series of successive overtones, the frequencies of 
which or numbers of vibrations per second are proportional to the 
terms of the natural series of integers 1, 2, 3, ... The amplitude Npn 
sin nzgjl can be negative for certain values. Its absolute value can be 
taken on adding x to the phase. 

Solution (39), i.e. the emitted sound, is the addition of these separate 
tones or harmonics; their amplitudes, and therefore their influence on 
the sound, generally diminishes rapidly as the number of the harmonic 
increases, whilst the total result of their effects produces the timbre 
of the sound, which differs with different musical instruments and is 
due to the presence of the overtones. 


At the points 
PEN |e Oc cele N A (45) 
n? n n 

the amplitude of the nth harmonic vanishes, since sin nzz/l = 0 at 
these points, which are called nodes of the nth harmonic. On the other 


hand, at 


dae he 3 e r CONE (45,) 
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the amplitude of the nth harmonic has maxima, since sin nzz/l attains 
its maximum absolute value at these points, which are known as anti- 
nodes of the nth harmonic. With this, the string vibrates as though 
it were composed of n mutually unconnected separate pieces, each 
fixed at its boundary nodes. If we press the string exactly at the mid- 
point, i.e. at the antinode of its fundamental tone, we obtain vanishing 
not only of this tone but of all the others that have antinodes at this 
point, ie. the 3rd, 5th, ... harmonics; whereas the even harmonics, 
which have nodes at this point, are unaffected, so that the string now 
emits the octave instead of the fundamental, i.e. the note with twice 
the number of vibrations per second. 

The above method might be distinguished from that of character- 
istics by calling it the standing wave method ; but it is generally known 
as Fourter’s method. 

We can readily show that the solution given by series (39) is com- 
pletely identical with that obtained above in [166]. We start by re- 
calling the proof in [166] that the use of d’Alembert’s formula (16) 
in the case of a finite string requires the odd continuation in the 
interval (—J, 0), and thereafter continuation with period 2l, of the 
functions g(x) and p, (x) specified in the interval (0, 1). But this method 
of continuation is exactly equivalent to the sine expansion of these 
functions [145], i.e. is exactly equivalent to expressions (41) for any x. 
On substituting these expressions for g(x) and p; (x) in d’Alembert’s 
formula (17), we in fact arrive at solution (39), as may easily be seen: 

es 15 A, [sin na ez at) + sin nn e+ at) | mn 


n=) 


or 


ı g f = f 
w=5 5 An [sin nn e at) Jeina at at) ] + 


1 = 
+ 22 B, [cos nn e at) cos 7 et at) Ji 
whence (39) follows at once. 
In the present case, Fourier’s method has the disadvantage compared 
with the characteristics method that series (39) is often very slowly 
convergent and is unsuitable both for computation and even for the 
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rigorous proof of the fact that the series is in fact the solution, since 
this latter involves its differentiation twice term by term, which brings 
in a factor of n? before the nth term. The relationship between the 
required function and the originally assigned (x), y,(x), expressible 
by series (39), is much more complicated outwardly than the relation- 
ship given by the characteristics method. Nevertheless, Fourier’s 
method reveals the extremely important fact that the string has an 
infinite set of individual harmonics, addition of which gives the total 
vibration. 

On taking into account the discussion of [166], we can say that the 
sum of series (39) will give the solution of our problem with continuous 
derivatives up to the second order, if g(x) and p, (x) possess the pro- 
perties stated in [166]. If g(x) has continuous derivatives up to the 
third order and satisfies (0) = 9” (0) = y(t) = g” (D) = 0, whilst 
pı (x) has continuous derivatives to the second order and p, (0) = 
= p (l) = 0, series (39) can be shown to be twice differentiable with 
respect to x and ¢. It is also possible to consider solutions of the wave 
equation with fewer assumptions regarding the initial data, and we 
shall discuss this in Volume IV. In future applications of Fourier’s 
method, we shall not stipulate the conditions under which the series 
obtained in fact represent solutions of our problems. The general aspects 
of Fourier’s method are dealt with in Volume IV. The present aim 
is to indicate the method of solution and the results obtainable. 
A further point: it follows at once from the arguments of [164] and 
the characteristics method of [166] that the above solutions are 
unique, both for an infinite and a finite string. The question of the 
uniqueness of the solutions of the general wave equation is discussed 
below. 


169. Forced vibrations. We deduced in [163] the equation of the 
forced vibrations of a string acted on by a force F(x, t) per unit length: 


Gane Ete feo-tres). w 


Boundary and initial conditions must be associated with this 
equation (taking the case of a string fixed at the ends): 


U\xno = 0; Ufy = 0, (47) 


ulo = pla); S| = 91 (2). (48) 
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These forced vibrations of a general type can be represented as the 
result of adding two vibratory motions; one of these is purely forced 
and due to the action of the force F, the string being assumed still 
in the state of rest at the initial instant, whereas the other is the free 
vibration accomplished by the string simply as a result of the initial 
disturbance without the force being present. This amounts analyti- 
cally to replacing u by two new functions v and w, in accordance with 
the formula: 

u =v +w, 


where v satisfies the conditions: 


0? > 8? 
ar HC ae thle i), (49) 
v lem =0; v an = 0, f (50) 
Ov 
v lt<0 = 0; Oo t<0 = 0. (51) 


and provides the purely forced vibration, whilst w satisfies the con- 
ditions: 


ew Ow 
= a2 
at? ss Ox? ’ 
w|,-9 = 0, w |z- = 9, 


Ow 


w | no =P (2), -r pı (x) 


i=0 


and gives the free vibration. We may readily verify, on forming the 
sum u = v + w, that this provides the solution of our problem, i.e. 
of equations (46), (47), (48). 

The methods of finding the free vibrations w have been considered 
in previous sections,so that weshall deal here only with how to find v. 
As in the case of free vibrations, we seek v as a series: 


v(x, t) = XTn (t) sin ==, (52) 
so that boundary conditions (50) are automatically satisfied, whilst 
the Tn (t) are obviously different from the functions that we had in 
[167], since equation (49) is not homogeneous. 
We obtain on substituting series (52) in equation (49): 
YT Osn = —a? ST, (t) (+) sin" + f (x,t), 
n=l 


n=! l 
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whence, on replacing anz/l by the quantity œn (44) [168]: 
fo d= > [ral )+ w2 7, ( t)| sin“. (53) 


We can expand f(x, t), considered as a function of æ, in a Fourier 
series in the interval 0 < g < l: 


= > fn (6) sin, (54) 


n=1 
the coefficients f,» (t) of this series being dependent on t and given by 
l 


fa (t) = F f (8 t) sin“ de, (55) 
0 


On comparing expansions (53) and (54) for the same function f(x, t), 
we obtain the series of ie 
TRO HoT =fr (n=1,2,...,), (56) 


defining the functions T, T, (t), . 

Having thus defined the functions T, (t), function (52) now satisfies 
differential equation (49) and boundary conditions (50). In order to 
satisfy also the remaining initial conditions (51), we merely have to 
subject the T, (t) to these conditions, i.e. we put 


T,, (0) =0, T%(0) =0, (57) 
since it is then clear that 


On = O 
o| = 57,0) )sin = = 0; Fo at ) sin F =0, 


n=l 
The solution of equations (56) and (57) was indicated in [28], whence 
we readily deduce that: 
1 j F 
T, => f fa (1) sin wp (t — T) dr, 
One 
or, on substituting expression (55) for fn (7) 


t 1 
Th () = 7 fart ee, T) sin w, (t — t) sin = dz. (58) 
ò ò 


Substitution of this in (52) gives us the expression for v(x, t). It is 
easily shown that if f(x, t) has continuous derivatives up to the second 
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order and /(0,t) = f(l, t) = 0, the sum of series (52) is in fact the 
solution of problem (49)—(51). 

We have so far considered non-homogeneity either in the initial 
conditions (with function w) or in the differential equation (with 
function v). It is natural to consider also non-homogeneous boundary 
conditions. If we assume the equation and the initial conditions homo- 
geneous and again let u denote the required function, we get the 
following problem: 


Bu Ou 


1 = g _ Ou 
oe Cae u l0 7 2 H; uj 50 (t); ul =F 


t=0 


We shall discuss this case of non-homogeneous boundary conditions 
in Volume IV. 


170. Concentrated force. We investigate (58) in the case of a concentrated 
force at the single point C (x = c). We shall denote the magnitude of the force 
by oP instead of by P as in [163]. As has been shown [163], this caso can be 
considered as the limit of the case when the force F acts only on a small interval 
(c — 6,c + ô), F being zero outside this interval, whilst its total magnitude 


c+6 
f F(z, t)dz > oP (t) as 6+0. 
fo] 


E= 


We have from the second of equations (4): 


c+ 
f 1de > P (0) as 630. 
2 


If we note that f(z, t) is zero by hypothesis outside the interval c — ô< 
< z < c + ô and use the first mean value theorem fI, 95] with the assumption 
that f(z, t) has an invariable sign in 


c-d<z2<c+0, 


we get: 

l c+ò c+é 

[ #(2.0) sin a dz = | te t) sin 22 dz = sin nme [ /@,0 a, 

6 c—d c>ô 
where ¢ is a point of (c — 6, c + ô). 

In the limit, as 6 — 0: 
l 
nr nrc 


f(z, t) sin 


5 dz —> P (t) sin 
0 


1 ’ 
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and the function T(t), defined as the limit of the right-hand side of (58) as 
ô — 0, now becomes 


2 n: 


t 
Tali) = sin fe (t) sin ©, (t — T) dt, 
0 


lon l 
whilst the forced vibration is given by tbe expression: 


nac nar 


t 
v(m) = Sgin i | P (a) sin wn (t — 2) dr -sin 
n=l 0 


f (59) 


This formula shows that certain overtones can be absent in forced vibrations, 
these being the overtones for which 


. NTC 
sin 7 = 0, 


i.e. which have a node at the point C of application of the force. 
We shall dwell on the case when the impressed force is of a harmonic csel- 
latory type and we have to put 


P (t) = Py sin (wt + po), 
or, if we take the phase p, = 0 for simplicity: 
P (t) = Psin wt. 
The expression for T(t) now gives: 
t 


Ta (t) = ia sin = fe sin wr sin w, (t — t) dt = 
P t 
. NNC ) 
=— T a I feos [ont — (wn — w) t ] — cos [wnt — (wn + o) t]} dr = 
0 
— 2w Po . ANC 2Po 


; . nne . 
= ja, (wo? — a) sin —— sin Ont + Tas =o T sin wt. 

If the frequency of the impressed force is not coincident with any of the 
proper frequencies wp, none of the denominators (w3 — w?) vanishes; where as 
if œ approaches one of the w,, the corresponding denominator decreases and 
the particular T„(t) becomes very large compared with the rest, i.e. the pheno- 
menon of resonance takes place. Finally, if œ = œp, the above expression for 
T(t) becomes meaningless and has to be replaced by another. 

Substitution of the expression obtained for the 77,(é) in (52) gives us: 


sin E 
n 
— 2 P, <, | I ; o NTE , 
v(x, t) = I >> or of ot Sent sia — + 
al = f. 
n=1 "On 
. nne 
eo SIn I nax 
+ © sin wt in — 
= 2 — w? I 
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The first term on the right has the form of the proper vibrations, whilst the 
second has the same frequency as the disturbing force. We shall neglect the first 
term with free vibrations w(x, t) and only concern ourselves with the second, 
writing it as V(x, t): 

nac 


y ix D = 2E si ree Lin 
(x, t) = g Sino PETI T> 


or, on setting a? = w? lja? në: 


ain NT 
ine 
2P ae l nna 
V (x,t) = qi ge Sint > aa 8B (60) 
n=} 
The sum: 
. nne 
œo sin 
l sin naL 
PA n? — a? l 
n=) 


can be evaluated by the method indicated in [159]. Instead of proceeding along 
these lines, however, we shal] approach the problem in a new way by considering 
the concentrated force directly instead of as the limiting case of a continuously 
distributed force. 

The point C of application of the force divides the string into two parts 
(0, c) and (c, l). We consider these two parts individually, denoting the ordinate 
of the first part by u,(x, t) and the ordinate of the second by u(x, t). We get the 
following equations for the two functions u, and uz: 


2 2 
Te =w Tr fons a 8) (61) 

2 2. 
Ea =a? ous forc<a<l, 61,) 


since there are no external forces inside the intervals (0, c) and (c, l). Further, 
we have the conditions for the fixed ends: 


Uy Ino =O, Us [rms = 0, (62) 

hen the condition for continuity of the string at ~ = c: 
Uy xec = Ya Ieee (63) 
and finally, the condition for equilibrium of the forces acting at a = c [163]: 


Iua 
Oa 


_ ðu, 
Ix=c Ox 


_. 8 1 
=-7 alia eal (64) 


x=e 
We confine ourselves to the case of a harmonic force 
P (t) = Py sin wt 


t With our present notation, we have to replace P in equation (7) [163] 
by ePi(t), and (0u/dxr),, (Ou/Ox). by 0u,/dx, Ou,/Oxr. 
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and distinguish in the vibrations produced those of the same frequency w 
We shall look for these vibrations in the form: 


u (x, t) = X (x) sin wt, 


where, however, we must have different expressions for the function X(x) 
in the intervals (0, c) and (c, l); as a result of this, we put: 


u, = X, (a) sin mt, U, = X, (x) sin wt. (65) 
On substituting in equations (61) and (61,), we have: 


— œ? gin wt X, (x) = a* Xj (x) sin wt, 
whence 


Xi (2) + a a (z) = 


and similarly, 
X; (0) +45 X, (0) = 
We thus obtain, using [27]: 
X, (xz) = 0; 1 cos — +O; sin — — 2; X, (x) = Oj cos s + Oj sin -© a. 
Conditions (62) give us: 
, P wl „n ol 
C, =0, OF cos —— + Cz sin- =0, 
whence it follows that we can put 
wl 


. œl 
Ci =C,8in—, Cz = — 0, cos —, 


where C, is an arbitrary constant. On denoting the arbitrary constant C, by 
C, for the sake of symmetry, we get: 


x) 


Pics 
X, (2) =O, sin 2", x, (z) =C, sin os - 


The continuity condition (63) now gives: 


w (I — c) 
a 


, oC. r : 
C, sin z sinet = Osin sin wt. 


It only remains to satisfy the last condition (64), from which we obtain 


w w (l —c) . w oc, Po 
— L o, cos CZ. ain oot — — 0, cos — sin wt = — —} sin ot. 
a a a a a 


Hence constants C, and C, are determined by the system of equations : 


wc o(l—c) _ Py 
=0; 0, cos -z> +0, cos a = ’ 


œ (l —c) 
aor a 


. 0c F 
0, sin s — O, sin 
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whence we obtain by simple working: 
én? (l—c) . oc 


_ P, a O = Eo. 
aw . ol 


the final solution of the problem being given by expressions (65) in the form: 


( . a(l—c) 
p ®— z ioe 
—? _____"___ sin —sinwt for 0< 2 <e, 
aw . wl a 
sin a 
u (x, t) = (66) 
P om n w (l — x) 
—? —__—.. sin ON inat for c<a<l. 
aw wl a 
a 


The reader may casily verify that solutions (66) and (60) for V(x, t) are 
identical, by expanding (66) as a Fourier sine series. 


171. Poisson’s formula. By analogy with an infinite string, we now 
consider the solution of the general wave equation: 


Pu u e) (67) 


ae TA (ar + oye Toe 
in unbounded space with given initial conditions. We start by deducing 
an auxiliary proposition. We shall find it more convenient to write 
the coordinates (x, Y, 2) a8 (Ti, Lp, %3). Let w(x, Xz, %3) be any contin- 
uous function with continuous derivatives up to the second order 
in a domain D or throughout space. All future arguments will be in 
reference to this domain. We consider the values of the function w 
on the surface O, (£i, £2 73) of a sphere with centre at the point 
(£i; Xo £3) and radius r. The coordinates of points of the sphere can be 
written as: 


& = 2, + ar; $, = t, + ar; & = £a + agr, 


where (a,, a» @3) are the direction-cosines of radii of the sphere. We 
can write these latter in the form: 


a, = sin 0 cos g; a, = sinOsing; a, = cos 0, 


where the angle 0 varies from 0 to x and the angle from 0 to 2x. 
We let do denote an elementary area of the unit sphere and d,o 
an elementary area of the sphere of radius r: 


d,o = sin 0 d dq; d,o = r? d;o = 7? sin 6 dé dg. 
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We consider the arithmetic mean of the values of the function œw over 
the spherical surface C, (x, 2, %3), ie. the integral of w(x, 2, £4) over 
the spherical surface divided by the surface-area. The value of the 
integral here obviously depends on the choice of the centre (2, 2», 23) 
and the radius r of the sphere, and the arithmetic mean will be a 
function of the four variables (x, 2, 73,7). We can write the arith- 
metic mean either as 


2n n 
1 
V (Lis Ly, Xz, T) = zl fo (a + ar; £a + agr; £ + asr) do, (68) 
00 
or as 
l 
V (£i, Ta Tg, T) = m] fo (xı + ar; % + Agr; Hs + agr) d,o. 
Cr 


We shall prove that, for any choice of function œ, the function v 
satisfies the same partial differential equation, viz: 
02 2 
ar wti, (69) 
where as usual, 
3w gv ar 
Av = a9 + aa + Gad’ 


The integration in (68) is carried out over the surface of the unit 
sphere, and we can differentiate with respect to 2; under the integral 
sign. We thus have: 


m 
1 
Av =z] f Aw (x; + ar) dio 
and 
3 1 2an 3 ə 
v K w 
or = Tr l l 2 Ox, Ak d;o. 


We can transform the last integral to a surface integral over the sphere 
O, (a4, To, Xs): 


Ov 1 : Ow 
Or = “Arr? JPA Ox, Qk d,o , 
Cr 
and application of Ostrogradskii’s formula gives us 


E- ahr [fjeae m 
D, 
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where D, is the sphere with centre (ti, £o %3) and radius r. The last 
expression is the product of two functions of r: the fraction 1/427? 
and the integral. The derivative with respect to r of the triple integral 
over the sphere D, is equal to the integral of the same integrand over 
the surface C, of this sphere. To see this, we need only say write the 
integral over D, in spherical coordinates. Hence a second differentiation 
with respect to r gives us: 


f= gi [[faodv + 4, f fiwa. 


D, C, 


If we substitute the above expressions for the different derivatives 
in equation (69), we see once that the equation is in fact satisfied. 
If r— 0, it immediately follows from (68) that v(x, £, £) tends to 
(21, Xo L3), Whilst it follows from (70) that dv/dr tends to zero, since 
by the mean value theorem, the triple integral in (70) is of the order 7°, 
whilst we have 7? in the denominator. We thus arrive at the following 
theorem: 

THEOREM. For any choice of a function w having continuous deriva- 
tives up to the second order, the function v defined by equation (68) satis- 
fies equation (69) and the initial conditions: 

dv 


Oh aia (£i, Zo, Lg); Er pa TO (71) 


We use this theorem to prove that the function 
u (x, Hg, X3, t) = tv (£i, Xe, T3: at) (72) 


satisfies the wave equation 


u _ (0u u Ou 
Je = a +a T oat) (73) 
and the initial conditions 
Ou 
u F =0; 5 ne (Lrs Zo, Ta). (74) 
We have in fact: 
Ou ; Ov (21, Tg Lg, At) 
a = V(X. To. Tg at) + at lr Ke Fe Ot) , 
Gu Ov (21, Las Ly. at) op O70 (Li, Los To At) ` 
gr T 4a ar Tat a i 


Au = t Av (Xi, £o 23, at), 
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where e.g. 00(2,, ®z Xg, at)/Ər is the value of the derivative 9v(2x,, La Ty, 
r)/Ər at r = at. On substituting the above expressions in equation 
(73), we obtain equation (69) for v with r = at, which is in fact valid 
as shown above. Initial conditions (74) are obtained immediately from 
(71). Inasmuch as (73) is a linear homogeneous equation with constant 
coefficients, we can say that the function u, = du/dt also satisfies the 
equation. We find the initial conditions for this function with ¢ = 0. 
Initial conditions (74) give us at once: 


Uy [imo = © (T3, Hy, 3). 
We have for the derivative 3u,/3t = 0?u/At?, by (73): 


Ou, 
Ot 


eu ou Ou 
— a2 
Ne a: (ae t gg t aF) 


? 


t=0 


whence we find on differentiating the first of initial conditions (74) 
with respect to the coordinates: 


Ou, 
ot 


t=0 


Thus the derivative with respect to t of the above solution satis- 
fying initial conditions (74) of wave equation (73) is likewise a solution 
of the wave equation and satisfies the initial conditions: 


ð 
= Ww (Xi, Xa, Xa) ; Ea =0. (74,) 


u = 
eo t=0 


If, on returning to the previous coordinate notation, we take some 
function 9, (x, y, 2) for (2, y, 2) in the first case of initial conditions 
(74), and some other function g(x, y, z) for w(x, y, z) in the second case 
of initial conditions (74,), and add the solutions thus found, we arrive 
at a solution of equation (67) satisfying the initial conditions: 


. ðu = 
ujo ZPE) Belg M1 (MY 2)- (75) 


If we write briefly T,{@(M)} for the arithmetic mean of the function 
w over the sphere with centre M(x, y, z) and radius r, by the above 
arguments, we can write the solution of equation (67) satisfying initial 
conditions (75) in the form: — 


u (M, t) = tT a {p1 ()} + [tT o {P 0]. (76) 
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This equation is generally known as Poisson’s formula. It can evi- 
dently be written in the form 


2n n 2n n 
u (x, y. 2, t) = ff Pı (a, B, y) d;o + E for, B, y) ao} (76,) 
00 00 


where d,o = sin 6 d6 dọ and (a, ĝ, y) are the coordinates of a variable 
point on the sphere: 


a=a+atsinOdcosy, B=y+atsinésing; p=2z-+atcos@. (77) 


The above arguments show that the function u defined by formula 
(76) in fact satisfies equation (67) and conditions (75) if (2, y, z) has 
continuous derivatives up to the second order and g(x, y, z) up to the 
third order. The last statement is bound up with the fact that the 
second term in (76) contains a differentiation with respect to t. 

If p(z, y, z) and p(x, y, z) have less satisfactory differential pro- 
perties, as happens, for instance, in problems with concentrated initial 
disturbances, it seems natural to suppose that formula (76,) still gives 
the solution. But in this case, the solution is in fact generalized and 
not classical (see Vol. IV). 

We shall see later that the problem treated can only haveone solution. 

Let us suppose that the initial disturbance is concentrated in a 
bounded volume (v) with surface (ø), i.e. p(N) and p, (N) are zero 
outside (v), and let the point M be situated outside (v). With t < dja, 
where d is the shortest distance from M to (ø), the sphere (Sar) is 
situated outside (v), both the above functions are zero on (Sa), and 
(76) gives u(M, t) = 0, i.e. rest at the point M. At the instant t = dja, 
the surface (Sar) touches (o) and the forward wave-front passes through 
M. Finally, with t > Dfa, where D is the greatest distance from M 
to a point of (øo), the sphere (Sa) will again be situated outside (v) 
[volume (v) will be entirely inside (Sa) ], and (76) again gives u( M, t) = 
= 0. The instant t = Dja corresponds to the passage of the rear wave- 
front through M, after which u(M, t) vanishes at this point instead 
of becoming a constant as in the case of a string (i.e. for a plane wave): 
The forward wave-front at a given instant ¢ consists of the surface 
which separates points already vibrating from points which have not 
yet begun to vibrate. It follows from the above that every point of 
this surface has a shortest distance at from (ø). The surface is readily 
shown to be the envelope of the family of spheres of radius at with 
centres on the surface (a). As we shall see, the constant a is the speed 
of propagation of the wave-front. 
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172. Cylindrical waves. We refer space to Cartesian coordinates and 
assume that functions g(x, y, 2) and ¢,(z, y, z) depend only on 2 and y, 
i.e. they preserve a constant value along any straight line parallel to 
axis OZ. If the point M(x, y, z) is displaced parallel to OZ, it is clear 
that the right-hand side of (76,) remains unchanged, i.e. the function 
u(x, y, z, t) is also independent of z, and (76,) gives us the solution of 
the equation: 


au Ou u 
EA (ar + ar) (78) 


with the initial conditions: 


eg TPO Ge hg = MH): (79) 


We can consider this solution whilst remaining exclusively on the zy 
plane. For this, we have to transform the integral over a sphere in 
(76,) to an integral over a circle in the zy plane. We take a point 
M(x, y) on the zy plane. The points with coordinates (a, 8, y) defined 
in accordance with (77) with z = 0 are variable points of the sphere 
(Sat) with centre M(x, y, 0) and radius at. An elementary surface area 
of this sphere is given by dS. = a? djo. Parts of the sphere situ- 
ated above and below the zy plane project on to this plane as circles 
(Ca) with centre M and radius at. The elementary area of the pro- 
jection dC, is connected with the elementary surface area of the 
sphere dS,, by the formula [62]: 


dC a 
cos (n, Z) ’ 


dSa = 


where n is the direction of the normal to (Sat), i.e. the radius of the 
sphere, forming an acute angle with the z axis. If N is a variable point 
of the sphere and JN, its projection on the zy plane, it is clear from ele- 
mentary geometry that 


NN, _ Yau? —(a— a) Py 
MN at i 


cos (n, Z) = 


where (a, 8) are the coordinates of a variable point of the circle (Cat). 
On making all these substitutions in the first integral of (76,) and 
noticing that circle (Car) is obtained both from the upper and the 
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lower parts of sphere (Sat), we get the following transformation for the 
first integral: 
2n n 


iffa (a, B, Y) do = az | fo (a, B) dS, = 
0 0 (Sot) 


at, 


= 1 > gı (2, P) 
~ Bra | | Var? — (a — a)? — (B — y} AC 


On applying the same transformation to the second integral and 
writing the elementary area dC, on the xy plane in the form da df, 
we finally get the following formula for the function satisfying equation 
(78) and conditions (79): 


aal pı (a, P) da dB 
u (x,y, t) = 2na i Yat? — (a — x)? — (a — y)? 


að 1 ¢ (a, £) da dB 
+ 2na l! Var? — (a — x)? — (B— y} | (80) 

Let the initial disturbance be confined to a finite domain (B) of the 
xy plane with contour (I), ie. p(x, y) and p(x, y) are zero outside 
(B). For instants t < dja, where d is the shortest distance from M 
to contour (l), the circle (Ca) has no common points with (B), p(x, y) 
and q, (x, y) are zero throughout (Cat), and (80) gives u(x, y, t) = 0. 
At the instant t = dja the forward wave-front reaches M. For t > 
> Dja, where D is the greatest distance from M to contour (l), domain 
(B) lies wholly inside the circle (Cat) and the integration in (80) must 
be carried out simply over (B) since g(x, y) and ¢, (x, y) vanish 
outside (B), i.e. 


l f f g, (a, £) da df 
ana J J yae — (a — a)? — (B— y} 


(B) 
ə l ¢ (a, B) da d 
+ ot 2na J Var? = (a — a)? — (f — y} | 


u (x,y, t) = 


In the present case, the function u(x, y, t) does not vanish after 
passage of the rear wave-front at the instant t = Dja, as in the case 
of three-dimensional space, nor does it become constant as in the 
case of a string. But in view of the presence of a? t in the denominator 
we can still affirm that u(x, y, t) tends to zero on indefinite increase of t. 

The phenomenon occurring here is known as wave diffusion after 
passage of the rear front. All our arguments have been carried through 
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whilst remaining on the zy plane. In three-dimensional space, equation 
(7%) corresponds to so-called cylindrical waves. 


173. The case of n-dimensional space. The results obtained in [171] can be 
generalized immediately for the case of any number of dimensions. Let us con- 
sider n-dimensional space with coordinates (Xi, 2%, ...,%,). The volume of a 
sphere of radius 7 in such space is given by the formulse [99]: 


gn 
, (27)? a 
vn (r) 2-4-6...(m—2)n 7 (n even), 
1 1 
oq (t) = fe dd) 
n E O Oe a 


On differentiating these expressions with respect to r, we get the surface 
area of the sphere: 


gn 
_ (2x)? n—1 P 
a(r) = 2.4-6... 0—2) Gs even), 
gar) ge) 
At 
Oiler ase ma (m odd), 


The direction-cosines a, of radii of the sphere are given in terms of (n — 1) 
angles by the formulae: 


a, = cos b, 
a, = sin 6, cos 6, 
a, = sin 8, sin 6, cos 6, 


On-2 = SiN 6, sin 6, ... sin Op_3 COS Bp» 
an-ı = Sin 6, sin 6, ... sin Op- COS Q 
a, = Sin 6, sin 8, ... sin @,_, sin Q, 


where 
O<O <a; O< p< 2x. 


An elementary surface area of the unit sphere becomes 
do = sin"~? 6, sin”™? 6, ... sin 9, ,d0,d6,... d0,_,dy, 
and of a sphere of radius r: 
dyo = r! do. 


Let w be a function with continuous derivatives up to the second order. 
given in the space Rp. Its arithmetic mean over a sphere of radius r and centre 
at (£p ...,%,) is given by: ae S 


-V (Ei Ea vey Em T) = 


On wl fo + ar, £y + azr, ...,2,_ + anr) do 
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or 


1 
On (r) 


|- fotar, La + azt, ..-,Lq + anr) d,o. 


vln baist T= 


We can show, precisely as above, that the function v satisfies the differential 
equation: 


Ov n=l Ov 
cy t oe 
and the initial conditions: 
Ov 
v = (By 023 Xp)3 am =0. 
N w (ay n) OF luo 


The resulta obtained can be used to derive definitive formulae for the wave equa- 
tion with any number of independent variables. We shall simply state final 
results for the general case. 

The solution of the wave equation 


Oru Ou Ou 
Or 7 (ja +3 + at Ta teal (81) 


with initial conditions 


ðu 


u| =0; DE yg ZO OPIO eo n) 
has the form, with n odd: 
nza nos 
2 2 3? Rts 
U (Gy Emt) = 7 30a 23) y [Tako y} (82,) 
acy “> 
and with n even: 
U (Tis <. Lp t) = 
n-2 at n=2 
22 ə r 3 ? — 
= n-2 [r* $ T, {w (x,)}] dr, (82,) 


24 (n—2) &) yan ae 


where 7, {w(x;)} is the arithmetic mean of the function w(x, £z, ..., n) Over 
a sphere of radius ọ and centre at (£i, £s ...,%,). A sufficient requirement 
for verifying (82,) and (82,) is that w/(2,, 2, ...,%,) should have continuous 


derivatives up to order (n + 1)/2 in the case of n odd, and up to (n + 2)/2 
in the case of even n. 


174, Non-homogeneous wave equation. We take the non-homo- 
geneous wave equation: 


Ou 


at 
oe =e (r + 


u 
Oy? 


+ <4 + f(x,y, 2, t) (83) 
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in infinite space and look for a solution of it satisfying the zero initial 


conditions: 


i =: 0; Ou 


0 a leo (84) 


Addition of this solution to the solution of the homogeneous equation 
satisfying initial conditions (75) will give us a solution of equation 
(83) satisfying initial conditions (75). 

In order to solve the above problem, we bring in the solution of the 
homogeneous equation 


aw Ow Ow ew 
= ala + ar ar) (85) 
satisfying the initial conditions 
. dw — 
w t=r =a 0; “Ot ime =f (x, Y, Z, T), (86) 


t = t being taken as the initial instant instead of t = 0, where r is a 
parameter. The function w is given by Poisson’s formula, except that 
t has to be replaced here by (t — t), on account of our new initial 
instant. We thus have: 


w (x, Y, 2, t; T) = 


an n 
| [ f[z+ ma(t—2), y+a,a(t—t), z+ aa (t— t), T] d,o, 
0 0 
(87) 
where 
a, = sin ĝ cosg; a, = sin Ô sin p; a; = cos ô. (88) 


We notice that w depends on the parameter t as well as on the 
ordinary independent variables (x, y, z, t). We now define a function 
u(x, y, z, t) by the formula: 


t . 
u(x, Y, 2, t) = f w (z, Y, z, t; t)dr (89) 
ò 


and show that it satisfies non-homogeneous equation (83) and zero 
initial conditions (84). We have: ; 


st 


t 
Ou = ow (x, Y, 2, t; T) , 
a fo A A AA (90) 
0 


174] NON-HOMOGENEOUS WAVE EQUATION 516 


The term outside the integral is zero by the first of conditions (86). 
A second differentiation with respect to ¢ gives us: 


au Bw (x, Y, 2, t; T) Ow (x, Y, 2, t; T) 
out = [eagan at? dr + T ğ rat’ 


the term outside the integral being in this case equal to /(z, y, 2, t) 
by the second of conditions (86), i.e. 


t 
Ou Btw (2, Y, 2, t; T) 
a = [SAE w EST ar + f(x,y, 2, t). 


0 


Differentiation of expression (89) with respect to the coordinates 
simply requires differentiation of the integrand: 


t 
Au = J Aw (x, Y, 2, tt) dt. 


It follows at once from the last two expressions and equation (85) 
that u satisfies equation (83). Initial conditions (84) follow immedia- 
tely from (89) and (90) on taking account of the fact that the term 
outside the integral in (90) is zero as shown above. Expression (89) 
thus gives the solution of equation (83) with initial conditions (84). 
On substituting for w(x, y, z, t; t) in (89) in accordance with (87), 
we get: 

t 2n n 
1 
u (£, yY, 2, t) = 7 -ff [fle + aalt- 7), y + aa (t — t), 
00 


ò 
z + aga (t — T), 7] d,o] dr. 


This expression for u can be written in a new form. Instead of t, we 
introduce a new variable of integration: r = a(t — t). On carrying out 
the change of variables, we get: - 


u (x, Y, Z, t) = 


at 2n n 


= ga f f fiear, y + ar, z + ay, t — =) r? sin 0 dr d dg, 
000 
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or, on multiplying and dividing by r: 


u (£, Y, 2, t) = 
at 27 ps as 2) 
Sial Ji - r? sin ô dr d0 dọ. 


On taking account of expressions (88) for the a, and recalling the 
expression for an elementary volume in spherical coordinates, we see 
that the three quadratures in the above formula are equivalent to the 
triple integral over the sphere Dat with centre (x, y, z) and radius at. 
On bringing in the variable point 

=x + ar; n=y + ar; = z + ar, 
and noting that at + a2 + a3 = 1, we get: 
r=V(E-2P? + (1 — y}? + (È — 2), 


and the expression for u finally takes the form: 


w (e921) = gar] | [———** 


rat 


i oe ~a) dv, (91) 


where the inequality r < at characterizes the sphere Dat mentioned 
above. The characteristic feature of the integrand in the last expression 
is that the function f runs from the instant t — (rja) which precedes the 
instant ¢ for which u is evaluated. The difference rja between the 
instants gives the time required for passage from the point (&, n, ¢) 
to the point (x, y, z) with velocity a. Expression (91) is generally known 
as the retarded potential. It should also be noted that the basic formula 
(89) has a simple physical meaning: it shows that the solution of non- 
homogeneous equation (83) with initial conditions (84) is the sum of 
the impulses w(x, y, z, t; r)dt derived from the presence of the term 
f(x, Y, z,t) and defined by equations (85) and (86). 

We now consider the non-homogeneous wave equation for cylindri- 
cal waves: 


eu au 
Ou? a(S 


+ $a) Heyt) (92) 


with zero initial conditions. Precisely the same method as above can 
be used to obtain the solution in the form: 


t 
u (x,y, t) = f w (x,y, 2) dr, 
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where w(x, y, z, t; t) satisfies the homogeneous equation 


w Ow Ow 
oe T d= + si | 
and the initial conditions 
Ow 
gi |. or Oe aa tgs f (=, y,) 


We finally get, on re (80) into account: 


(€, 9, T) 
u (£, y, t) = ae | f la dé dy] de (93) 
0 to[alt-1) 
[e = (£ — x)? + (n — y)°]. 


It may be noted that we have integration with respect to time 
in this last formula, which was not the case with (91), where depend- 
ence on time reduced simply to the dependence on time of the 
radius of the sphere over which integration was carried out and the 
dependence on time of the function f(x, y, z, t). In the linear case: 


a = oF a +j), (94) 
the solution obviously becomes: 
t x-+a (t—r) 
u (x,t) =; ÍS J f(E, T) dé] dr. (95) 
0 x-a (t-r) 


175. Point sources. We may suppose that the term f(x, y,z,/) in 
equation (83) differs from zero only in a small sphere with centre at 
the coordinate origin. On letting the radius of the sphere tend to zero 
and the intensity of the external force increase indefinitely, we obtain 
in the limit the (so-called generalized) solution of the wave equation 
with the presence of a point source, which starts to act at the instant 


t = 0 and which can have any law of action in relationship to time. 
We take 


f(x,y,z t) =0 for Vee +y te oe (96) 
and 
i Silay, z, t) dx dy dz = 42 (t), (97) 


where C, is a sphere with centre at the origin and radius £. We return 


to equation (91), where we shall assume at > Va? + y2 + 2. By (96), 
it is sufficient to carry out the integration over the sphere C,. As 
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€ — 0, r becomes equal in the limit to the distance from point (a, y, 2) 
to the origin, i.e. r = Vx? + y? + z2, and we obtain on taking (97) into 


account: 
u (x, y, 2, t) =+o(t—2) (98) 


(=P LP Te. 


Moreover, we must take u(x, y, z, t) = 0 for r > at, since the domain 
of integration in (91) for r > at does not include the sphere C for 
sufficiently small c. It may be noted that function (98) satisfies the 
homogeneous wave equation and has a singularity at the coordinate 
origin for any choice of twice continuously differentiable function w(t). 

In the case of equation (92), we proceed exactly as above and take 


f(z.y,t)=0 for Vr +y >e, 


and 
S S flay, t)dady = 27o(t), 
Ye 


where y, is a circle of radius € with centre at the origin. On returning 
to equation (93) and passing to the limit, we get the solution for a 
point source in the case of cylindrical waves as: 


u(z,y,t)=0 for at<o 
(o = Vx? + y?). 


The difference between (98) and (99) is similar to the difference 
pointed out in the previous section. Formula (98) shows that the in- 
fluence of a point source at the instant ¢ at the point (x, y, z) depends 
only on the intensity of the source at the instant t— (rja). In the case of 
(99), this influence is determined by the action of the point source for 
the interval of time from ¢ = 0 to t = ọja. 

In the linear case (94), after as usual putting 


+e 
J f(x, t) dx = o (t) and /(z,t)=0 for |x|>e 
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we obtain on passing to the limit in (95): 


e 
d a 


u (x,t) = f o (r)de tor |x| < at, 


(100) 
u(x, t)=0 for |a|> at. 


176. The transverse vibrations of a membrane. We have so fer considered 
the wave equations in a plane and in space in the absence of a boundary, and 
have thus only had the initial conditions in addition to the differential equation. 
Boundary value problems involving the wave equation either in a plane or in 
space are much more difficult than in the linear case. We shall consider the 
two particular cases of such problems in a plane which arise when the boundary 
is a rectangle or a circle. The wave equation in a plane will be interpreted physi- 
cally as the equation of the transverse vibrations of a membrane. 

We understand by membrane a very thin sheet which, like a string, does 
work only by extension and not by bending. If the membrane is under the action 
of a uniform tension 7’, and lies in the (x, y) plane at equilibrium, and if we confine 
ourselves to the case when motion occurs parallel to axis OZ, the displacement 
u of a point (x, y) of the membrane becomes a function of x, y and t, satisfying 
a differential equation analogous to that for a string, i.e. 

Ou 


Ge we ( Ge tage) tie, (101) 


a= |, 
e 
e is the surface density of the membrane, and of is the external force or loading. 
We shall not dwell here on the derivation of equation (101). 
Apart from differential equation (101), we also have to take into account 
the boundary conditions which must be satisfied by the function u on the bound- 


ary of the membrane (C). We shall only discuss the case when the contour (C) 
is rigidly fixed, i.e. 


where 


u=0 on (C). (102) 


Finally, we have to specify the initial conditions, i.e. the displacement and 
velocity of every point of the membrane at the initial instant: 


Ou 


u = Qı (2, y), On 


t=0 o7 ae y). (103) 


t 


177. Rectangular membrane. We consider the free vibrations of a rectangular 
membrane, the contour of which consists of the rectangle with sides 


æ = 0, x =l, y=0, y=m (104) 


in the (x, y) plane. We shall assume that external forces are absent, i.e. f = 0. 
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Here we have to find the solution of the equation 
Ou Ou Ou 
—— = Q? J-—— i 

ae T e (a3: + ar) 


satisfying conditions (102) and (103). 
On again applying the standing wave (Fourier’s) method, we seek a partic- 
ular solution of equation (105) in the form: 


(a cos wt + $ sin wt) U (x, y), (106) 


(105) 


which gives us 


— w? (a cos ot + B sin ot) U (2, y) = at (SE + 20) (a cos wi + $ sin wt), 


whence, on setting 
2 
w 


A =k, (107) 


a? 
we obtain the equation for U as: 


U , U ira 
ae tae te ktU =0. 
We seek in turn a particular solution of this last equation in the form 
U (x, y) = X (x) Y (y), (108) 
which gives us: 


X” (æ) Y (y) + X (x) Y” (y) + k? X (x) Y (y) = 0, 


X’) O YUEYU y 
X (2) F) l 


or 


where A? is an as yet undefined constant. 
We thus have the two equations: 


X” (x) + MX (x)= 0; PF" (y) + HY (y)=0, (109) 
where 
pes ht— at, ptt At =k. 
Equations (109) give us the general forms of the functions X(x) and Y(y): 
X (x) = C, sin da + 0, cosåæ; Y (y) =C,sin wx + C, cos po. 


The condition 
wed on (C) 


gives us 
U (@,y)=9 on (0), 


whilst this last condition in turn splits up into the following conditions: 


X(0)=0; X(()=0; Y(0)=0; Y(m)=0, 
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from which it is obvious that C, = O, = 0; also, if we neglect the non-zero 
constant factors C, and C3, we now have: 
X (x) =sindz, Y(y) =sin py, (110) 
with, however, 
sindl=0, sinum=0. (111) 
It follows from equations (111) that 4 and u have the infinite set of values 


on Tta 

Aes Ayy E EE = by: May eee Hg eed Ig => U a (112) 

On arbitrarily selecting a A and a u from sequence (112), we get for the 
corresponding value of the constant k?: 


o? T? 
h= tme H) 


m? 


and find from (107) the corresponding frequency w for this value of k?: 


2 2p? ape [ae 
wa,r = Ako, r = aa E the s (113) 


We substitute 4, for A and y, for u in expression (106) and write the cor- 
responding a and ĝ as a, ,, 8,, and hence obtain an infinite set of solutions of 
equation (105) satisfying boundary condition (102) in the form: 


; . OnE . TAY 
(ar, 1 COS Wg, q E F Bo, SIN Wg, yt) sin 7 n- 


i.e. the infinite set of proper (free) harmonic vibrations of a membrane corres- 
ponding to the free vibrations of a string. 

The constants a, 8 are determined from the initial conditions. Substitution 
of t = 0 in the expressions: 


eo 
; . OMe . TAY 
u = (ag, r COS wg, r t F Bo, r SIN wg, t) sin 7 mn’ 
o,7ț=1 
ðu C 3 . ong . TAY 
op > Og, r (Bo, r CO8 Og, r t — Ug,, SÌN Wg, ç t) sin 7 S 
o,t=1 


gives us on the basis of (103): 


= E . Onn . TAY 
u limo = Pı (BY) = 2 ag, Sin —7- sin — > 


522 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS [177 


These expressions are in fact the double Fourier expansions of the functions 
p, and p, and it may readily be seen that the coefficients a and £ are given by 


im 
4 . on , TR 
ag, = Jor | |e G4) sin E sin 7 agan, 
00 


(114) 
m 


l 
f [eE m sin E sint 
00 


an 
= dé dy, 


ae 
lm 


eo, Boys = 


which provides the solution of our problem. 

The case of a membrane differs from that of a string inasmuch as, for the 
latter, each proper frequency corresponds to a particular form of the string, 
which is simply split up by nodes into several different parts. It is possible with 
a membrane that the same frequency corresponds to several different shapes 
with different positions of the nodal lines, i.e. the lines on which the amplitude 
of vibration becomes zero. This can be studied most simply by considering a 


square membrane: 
t=m=r. 


The frequency w, , is defined in this case by 


wo, = VOTE = a VEFE, (115) 


where a = az/r is a factor depending neither on o nor rt. 
On setting o = r = 1, we get the fundamental tone u, of the membrane 
with frequency w, = a2: 


> _ mz. ny 
ay, = N, sin (wut + Pr) sin ~— sin ——+ 


Here, there are no nodal lines whatever within the membrane. 
On next putting 


o=1, t=2 or o=2, t=], 
we get two further tones of the same frequency 


Oy, = Oy, =a V5, 


given by: -> 


Be ve . næ . 2nay 
Uiz = N 2 SIN (W3 t + Pia) BiN T OnE 
: . 2G . n 
Ug, = No, Sin (Mg) t + Py) sin sin TE. 


The nodal lines of these elementary vibrations are respectively 


= o s=2 
STED TS 
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But in addition to vibrations u,, and u», an infinite set of further vibrations 
of the same frequency exists, obtained from linear combinations of u, and uz. 
On writing 9,2 = Pa = 0 for simplicity, we get a vibration of the form 

sin wt [sin = sin Zt + N, sin — shi sin sin Z], 
where w — w:0 a, N, = N, and N; = Na 

With N, = N,, the nodal lines are defined by the equation 


. me , 2 . 2ax Eg n nE n 
0 = sin — sin Y + sin — z Z sin TE = 2sin™ sin 74 (cos ZE + cos ZE), 


giving us the nodal line 


xsx+y=r. 
With N, = —N,, we find by precisely the same method the nodal line 
a—y=0. 


N, =0 N,=0 N2=N7 No=-N; 


w=x V5 


Fic. 134 


These elementary cases are illustrated in Fig. 134. We get more compli- 
cated nodal lines for the same frequency when N, # + N, and N,, N, £0. 
These all have equations of the form 


mx ui 
N, cos -5 +N, cos AH = 


On now setting 


we obtain the unique tone of frequency 
Wy, = a V8, 
the nodal lines of which are (Fig. 135): 


1 1 
s=5" and y=z" 


The next case: 
o=1, t=3, o=3, t=l 


again leads to an infinite set of vibrations of the same frequency ws = 3, = 
= a V10. Figure 136 illustrates their nodal lines in elementary cases analogous 
to those with frequency w,, = @,, = a Y5. All these figures are in fact the well 
known Chladni’s figures of acoustics. 
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The forced vibrations of a membrane are investigated in exactly the same way 


as the forced vibrations of a string, except that the external force f(z, y, t) 
is expanded in a double instead of a simple Fourier series. 


N,=0 No=0 N2=-N; M=N, 


w=x V8 w= «VIO 


Fig. 135 Fre. 136 


178. Circular membranes. The case of a circular membrane gives us an 
example of the expansion of a given function in Bessel functions; it is important 
inasmuch as similar expansions are encountered in other notable problems of 
mathematical physics. 

We investigate the free (proper) vibrations of a circular membrane with 
contour of radius J and centre at the coordinate origin. We assume as a pre- 
liminary that there is no displacement on the contour. On introducing polar 
coordinates (r, 6) instead of rectangular coordinates (x, y), we have now 


u \pmt = 0. 


As in the rectangular case, we seek particular solutions of equation (105) 
in the form: 
(a cos wt + B sin wt) U, 


where the function U is now assumed given in terms of (r, 6) instead of (x, y) 
We obtain the same differential equation for U: 


aU : 
eat +5 n US kU =0, (116) 
except that it has to be transformed to the new variables (r, 6). This is done 
simply by expressing Laplace’s operator 


eU 


AU = Sar = + oye Oy? 


(117) 


in polar coordinates. We know that the operator for three variables 


gU 


AU = ae tr +- oad 


is given in terms of cylindrical coordinates by [119]: 


wo alee (ete) +e oe tee 
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We express (117) in polar coordinates by taking U as independent of z. We 
shall in future denote the length of the radius vector as r instead of ọ, and the 
polar angle by 6 instead of 9: 


eu eU ZU 1 0U 1 U 


Ox? dy? or? r Or r2 06? * 
Equation (116) becomes: 


eu 1 aU 1 @U 
Oe tr Or te age TMU =O 


We seek its particular solution as the product: 
U (r, 6) =T (0)+ R (r) 


which gives: 


T (O)| R (r) +R (P) +R (| +- T 0 R (r) =O, 


or 
T (0) PR (r)+rR'(r)+krR(r) as 
T (0) Rr) + 
and so finally: 
T” (6) + 42 T (8) =0, (118) 
R' (r) + R' (r) + G -3) R(r) =0. (119) 


The general solution of equation (118) is of the form 
T (6) = C cos de + D sin 29, 
and since the sense of the actual problem implies that U must be a single- 
valued periodic function of 6 of period 2x, we can say the same of the function 
T(6), which in turn implies that A must be an integer. If we confine ourselves 


to positive 4, we have to take 4=0,1,2,...,”,..., the corresponding 
expressions for T(8) and R(r) being denoted by 


T (0), Ti (6), T, (8), ---, Py (0), ---3 Ry(r), Ry (r), Ry(r),..., Rar)... 
We thus obtain an infinity of solutions of (105) of the form 

(a cos wt + f sin wt) (C cos n8 + Dsinné) RB, (r) (œw = ak). (120) 
The function F,,(r) satisfies equation (119) if we replace the å there by n: 


at 1 - 2 
Re (7) +- Ri (r) + (r° -=) R,(r) =9. (121) 
As we saw in [49], the general solution of this equation is 


En (r) = Cp (kr) + O2Ky (kr), (122) 


where J,,(x) is the Bessel function and K,,(x) is the second solution of Bessel’s 
equation which tends to infinity at x = 0; since the nature of the problem implies 
that the required solutions remain bounded at all points of the membrane, 
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the origin r = 0 included, the term in K,(kr) must be absent in the above 
expression for R,,(r), i.e. C, = 0. We can take C, = 1 without loss of generality, 
ie. we put 


Ra (r) = Jpn (kr), (123) 
in which case the boundary condition 
; u |e = 0 
gives 
J, (kl) =0. (124) 
On writing kl = u, we have the transcendental equation for g: 
Jn (u) =0, (125) 
which, as shown in the theory of Bessel functions, has the infinite set of positive 


roots 
u, ug, u, sees u, iy (126) 


corresponding to the values 


ny 
kM), k, EO, «2.5 kD = ET (127) 
of the parameter k and, by (107), to the values 
Omn = ak) (n=0, 1, 2,..., m=1. 2,...) (128) 


of the frequency w. The first nine roots of the first six Bessel functions are 
given in the attached table: 


2.404 3.832 5.135 6.379 7.586 8.780 
5.520 7.016 8.417 9.760 11.064 12.339 
8.654 10.173 11.620 | 13.017 14.373 15.700 
11.792 13.323 14.796 16.224 17.616 18.982 
14.931 16.470 17.960 | 19.410 20.827 22.220 
18.076 19.616 | 21.117 22.583 24.018 25.431 
21.212 22.760 | 24.270 25.749 27.200 28.628 
24.353 25.903 | 27.421 28.909 30.371 31.813 
27.494 29.047 30.571 32.050 33.612 34.983 


COeTIAMA EP WN Ee 


Subsequent roots may be evaluated from the approximate formula 


1 án? — 1 
(n) = — a oa ae Ee, 
ku 1 n (2n — 1+ 4m) nOn IF im’ (129) 


the accuracy of which, for a given n, increases with m. We cannot enter here 
into the derivation of formula (129). 
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It follows from expression (120) that the particular solutions obtained by 
us can be written in the form: 


(aD „ COS mnt + aP p SiN Om, n t) cos n8 -Jp (k™ r) + 
+ (BD p COS Om, nt + B® , SiN Om, n t)sinn6-J,(k™r) (130) 
(m, n= 1, 2,...). 


We also notice that equation (118) with 4 = 0 has two solutions: a constant 
and 6. The second solution is unsuitable since it is not periodic. Expression (120) 
gives in the first case the solution: 


i ; 
(a | COS Om, o t + a2 Sin Wm, o t) Jo (kr). 


This solution also has the form (130) (with n = 0), the only difference being 
that with n = 0 the second term in (130) vanishes due to the presence of the 
factor sin n8. 

It only remains for us now to satisfy the initial conditions: 


Ou 


U limo = Pr (r, 0); E T Pa (re). (131) 


te 


To this end, on bearing in mind the particular solutions arrived at, we seek 
u.as the double series: 


co 
u(r, 0, t) = > (a, COS Om, nt + a2 sin Om, nt) Cos 20+ Ty (kr) + 


Om,n (a n COS Om, nt — AD „Sin Om, nt) COS NO + Ty (kr) + 
o 4 

© 
+ > Wm,n (BY n COS Om, nt — Ba 1 81D Om, n t)sin nð- Jn (k r), 


n=0 
m=1 


and setting ¢ = 0 in these expressions, we arrive by (131) at the point of having 
to expand the given functions 9,(r, 0) and ,(7, 9) as double series of the forms: 


pi (7,8) = X (av , cos nd + BY sinne): J, (kr), 
mar 
7 (132) 
Pa (7,0) = > Om, n (a9 , cos no + BQ) „sin ne) -Jp (km r). 


n=0 
m= 


528 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS [178 


Expansion of 9¢,(7, 9) as a periodic function of 9 in an ordinary Fourier series 
gives us 
(1) 


pı (r, 8) = Ts + > (pP cos nd + yy sin no), 
n=1 


where 


n n 
pw = > fo (r, 0) cos n0 dé; pW) = L fo (r, 8) sin nO dð (n=O, 1,2, ...). (133) 


— 7 -x 


On comparing this expansion with the first of expressions (132), we readily 
obtain: 


og =2 So RN: p= SS an hr) | 
OR oe (134) 
uP = X Ppa Tepr). | 


m=) 


Coefficients ¢ and y? obviously depend on r, as indicated by their expres- 
sions (133). We thus arrive at the problem of expanding a given function of 
r in a series in J,(kKr) with fixed n. On obtaining these expansions, we can 
determine the coefficients a and £, and the problem is solved. 

Thus, let it be required to expand a given funetion f(r) as 


fn= > A mJ (kr). (135) 
m=1 


We assume that this expansion is possible and that it can be integrated 
term by term, so that we only have to show how to find the coefficients Am. 
To this end, we prove that the functions 


Tn (Hr), Ty (Wr), oe Ta (WPT), o 


possess the property of generalized orthogonality, i.e. 


L 
f Inr) Tn (kr) rdr=0 for o#r. (136) 
0 


In fact, if we substitute KM" then & for k?, and similarly J,(k r) then 
Jn(k@ r) for R,(r) in equation (121), we get: 


a? J, (kr) 1 dJ, (kr) n? 


d? Jp (kr) l dJa (km r) n? 
pe SE (eee T) Ja (err) = 0. 
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On multiplying the first equation by rJ,(k@™ r) and the second equation by 
rJ,(k r), subtracting, then integrating from 0 to l, we get: 


I 
(kon? — km?) [7r (AO r) Jp (kM r)r dr = 


d? Jp (ko d? J, (k™ 
-f | i 3) J (kr) — ae Lid) J, (km | rdr + 


dJ p (k® r) 


ir Jn (kr) — = J p (ko n| dr 


dJa (k™ r) 
+f [2 


We have by integrating by parts: 
f d? J, (k™ r) dT, (k 7) 
dr? dr 
Í dJ Ma r) d [rn EPA _ a n(k@r) 
z dr dr 


af es ad, (kr) rar — {Sia dJa a r) 


Jn (kum r)rdr = rd» (km r) — 


tJ, (kr) — 

. (n) , 
ar dF Jn (kr) dr 
and similarly: 


Í d? J, (kr) 


dr2 


Ie dJa en. dJa GP» na Si a(k r r) 


dJa (km r) 
dr 


J, (kr) rdr = rJp(k® r) — 


J, (kr) dr. 
We easily deduce from the above that 


l 
(E — K) S Ja (EP r) Sal RQ) rdr = 


dJ p (k™® r km 
=r pa ) Ja (Kk 7) — Teer d Jp (em Dy 
We have by definition of Kk”, k™: 


Jn (I) = J, (ROY = 0, 


r=1 


whence it follows that the right-hand side of the equation written vanishes 
for r = l. In view of the presence of the factor r and the finiteness of J,,(x) and 
J (x) for x = 0, we can say that the right-hand side also vanishes at the lower 
limit r = 0; but now it follows from the fact that kM) 4 k® for o # t that 


l 
S Ja (KO 71) Tq (kr) r dr = 0 
0 


which is what we wished to prove. 
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Having proved (136), the determination of the coefficients A,, in expansion 
(135) presents no difficulty: we multiply both sides of equation (135) by 
Tn (kh) r), integrate from 0 to l with respect to r, use formula (136), and find 
directly that: 


1 l 
J 1) Ja (em r)r dr = Apf vi (EP r)rdr. 


Hence we can say that, if expansion (135) exists and is integrable term by 
term, the coefficients A,, are given by: 


I 
SEO) Tn (R@ r) r dr 
a 


Am 7 
f Jn(k®r)rdr 
0 


Expressions (133) and (134) now give us the following formulae for the 
coefficients a) and fp): 


l 
i S ofp Jo (kg) 7 ar i PIES 
aD = yp = Jaf rn (r 8) Jo (ier) rdr 
J2(k r)rdr 27 f J2 (kOr)}rdar 77 0 
0 i 0 ti 
1 n l 
aD = = f ao f Pı (r, 8) cos n8J p (k r) r dr 
n f Jà(k®r)rar 77 j 
0 
l x l 
po = — § do f p, (r, 0) sin noJ,, (k™ r) r dr. 


n f JR(K@r)rdr = © 
0 


The same arguments can be used for determining the coefficients a), Be); 
it is only necessary to replace ¢, by ¢, in the above formulae and divide the 
corresponding expressions by wms n- 

As in the case of a rectangular membrane, the general motion of ẹ circular 
membrane consists of the addition of an infinite set of proper harmonic vibra- 
tions, it being possible for the same frequency to correspond to an infinity 
of different dispositions of the nodal lines. Several dispositions of the nodal 
lines are illustrated in Fig. 137, along with the corresponding frequencies, the 
fundamental frequency being taken as unity; the radii of the circular nodal 
lines are also given, as fractions of the radius of the membrane. 

When we apply Fourier’s method for the case of any contour, we can only 
separate out the factor depending on ¢ in (106), which leads to the equation: 


U , PU 


te t g tU SO (137) 
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and we have to find the values of the parameter k for which the equation written 
has non-zero solutions which also satisfy the boundary condition (102). We 
succeeded in doing this in the above examples with the aid of further expansion 
of the variables. This method cannot be used in the general case, and we have 
to consider equation (137) directly. Of course the problem does not have an 
explicit solution. The theoretical solution and some relevant qualitative results 
are given in Volume IV. The boundary value problem for the wave equation in 
threedimensional space in the case of a rectangular parallelepiped is solved pre- 
cisely as in [177], except that we arrive 

at Fourier series in three variables x, y, z 


The case of a sphere again leads to Bessel 
functions. We shall discuss this in Vol- 
ume III in connection with a more de- 


tailed treatment of the theory of Bessel 1.000 1504 


2,136 
functions. 
A detailed study of the convergence 
of the Fourier series obtained in the 
solution of boundary value problems for 
the wave equation in the case of several 
spatial variables is given in Volume IV. 2.296 2.653 2.918 
179. The uniqueness theorem. We 
now prove the uniqueness of the 
solution of the wave equation both 3.156 3.501 
in the case of unbounded space with Fie. 137 


given initial conditions, and with 
the further imposition of boundary 
conditions. For simplicity of writing we shall take the velocity a = 1, 
which is permissible on replacing the ¢ in the wave equation by at. 
We take the case of three independent variables for the sake of defi- 
niteness, i.e. the wave equation 

Ou Ou Ou 

Ot = Gat Oye” oe) 
and we start by considering the problem with initial conditions only, 
given throughout the (x,y) plane: 


Ou 
Ulizo = P(% Y); F l;o = Pi (5 Y) (139) 
The solution of this problem has already been described [172], and the 
method used could itself serve for proving uniqueness. Here we shall 
give a different proof, which is also applicable to problems with 
boundary conditions. If equation (138) with initial conditions (139) has 
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two solutions, u, and u, their difference (u, — u,) must satisfy (138) 
and the homogeneous initial conditions: 


ðu 
ulino = 0; | = 0. (140) 
We now have to show that u must be identically zero for any (x, y) 
and for any t > 0. We consider a three-dimensional space (xv, y, t) 
and take a point N(x, Yos to) of the space such that t > 0. From this 
point as vertex, we draw the conical surface 


(£ — xo)? + (Y — Yo)? — (t — ty)? = 0 (141) 


to its intersection with the plane t = 0. We also draw the plane t = 2, 
where 0 < t, < to and let D be the three-dimensional domain bounded 
by the lateral surface I’ of our cone and the pieces of the planes t = 0 
and t = 4 lying inside the cone (D is a section of a cone). We can easily 
verify the following elementary identity: 


Ou (du Ou Ou) Of au)\? , (du)? , (du)? 
2a (ar — car me) = wae) + (ae) + Ce) ]- 
0 (du du ð (ðu du 

~25 ae) 2a lea) (142) 
We integrate both sides over the domain D. The integral of the left- 
hand side must vanish since uw is a solution of (138). We can use 
Ostrogradskii’s formula to transform the integral of the right-hand side 
to an integral over the surface of D: 


ICI + CY + Bey Joos cm, 0 — 


Ou du Ou Ou 
— 2 -ar Gy 008 (n,x) — 2 Oi Oy cos (n, yi ds. (143) 


The function u and all its first order partial derivatives are zero by (140) 
on the lower base of the section of cone D and integral (143) vanishes 
over the lower base. We have on the upper base oa: 


cos (n, x) = cos (n, y) = 0 and cos (n, t) = 1, 


On the lateral surface of the cone T the direction-cosines of the normals 
satisfy the relationship: 


cos? (n, t) — cos? (n, x) — cos? (n, y) = 0, 
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and we can rewrite integral (143) over I in the form: 
1 Ou Ou 2 
7 JJ “cos (r, t) le cos (n, t) — Ft cos (m, æ) | + 


oe 


+ [ By cos (n, t) — i cos (n, nf} ds, 
so that we finally get 


ORORO 


We have cos (n, t) > 0 and consequently J > 0 on the surface I, 


so that i , 
HORORI 


from which it follows that at all points inside the total cone with 
vertex N (£o, Yo, to) the first order partial derivatives of u vanish, with 
the result that u is constant. It is zero on the bases of the cone by (140), 
and it is therefore also zero at the point N. This proof of the uniqueness 
theorem can readily be extended to the case of a boundary problem 
with equation (138). Let the solution of (138) be sought in a domain B 
of the (xv, y) plane with given initial and boundary value conditions, 
the boundary conditions being in reference to a contour J of domain 
B. We construct a cylinder with base B and generators parallel to thet 
axis, Every point of this cylinder corresponds to a definite point of 
domain B and a definite instant t. We suppose that we have the zero 
initial data (140) in B and that on the contour l of B we have the 
homogeneous boundary condition: 


We show that the function u is zero at all points of our cylinder. 
We take one such point N and draw the cone (141) through it. Let D 
be the solid bounded by the lateral surface of this cone, by our cylinder 
and by the planes t = 0 and ¢ = ¢t,. We again integrate both sides of 
identity (142) over this domain. All the arguments remain the same 
except for the appearance of the integral over the lateral surface of 
the cylinder on the right-hand side. If this integral can be shown to 
vanish, the above proof of the uniqueness theorem can be retained 
complete. The integrand in this integral coincides with the integrand 
of (143). But we have cos (n, t) = 0, and furthermore, du/dt = 0, on 
the lateral surface of the cylinder. The last equation follows at once 
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from the fact that points of the lateral surface of the cylinder consist 
of points of the contour / at different instants t, whilst we have the 
homogeneous boundary condition (144) on the contour J for any t. 
The integrand of (143) thus vanishes at all points of the lateral surface 
of the cylinder, and the above proof of the uniqueness theorem is fully 
preserved for the boundary value problem. During the course of the 
proof, we integrated the right-hand side of expression (142) over the 
domain D and applied Ostrogradskii’s formula. These operations are 
entirely justifiable if we assume that the function u has continuous 
derivatives up to the second order, which remain bounded inside the 
domain D. l 

We mentioned above that the investigation of problems of practical 
interest forces us to bring in so-called generalized solutions. We shall 
show in Volume IV that the uniqueness theorem is also valid for 
this wider class of generalized solutions. 


180. Applications of Fourier integrals. We take the wave equation in the 
linear case: 


Pu Ou 
“ar = ar (145) 
for the semi-infinite domain x > 0, with the initial conditions 
u 
u |= = P (2) IE = Y (x) (x > 0) (146) 
t=0 
and the boundary condition 
u |x- = 0. (147) 


This problem may easily be solved by the method indicated in [166]. In 
fact, it is sufficient to make an odd continuation of functions g(x) and ¢,(x), 
assigned in the interval (0, -+ oo), into the interval (— oo, +- oo), and afterwards 
to apply formula (17) for an infinite string. We obtain by setting x = 0 in 
this formula: 


+at 
u \ =o = Li Oh pele) +Í Pı (z) dz, 


—at 


and both terms vanish in view of the odd continuation of p(z) and p,(z), so that 
the boundary condition is in fact satisfied. 

If we apply Fourier’s method to the above problem, we get a Fourier integral 
instead of a Fourier series. As we saw in [167], application of Fourier’s method 
with the boundary condition taken into account leads to a solution of the form: 


u = (A cos akt + B sin akt) sin ke. 
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There is no second boundary condition, so that all values are permissible 
for the parameter k, i.e. we arrive at a continuous spectrum of possible frequen- 
cies k of the semi-infinite string. Instead of summing over discrete values of 


k as in [167], we now have to integrate with respect to the parameter k, A 
and B being of course taken as functions of k. We obtain: 


u (z, t) = { [A (k) cos akt + B (k) sin akt] sin ke dk. (148) 


The functions A(k) and B(k) must be found from the initial conditions (146) 
these give: 


œ Ło 
@ (x) =f A(k)sinkx dk; g(x) = f ak B (k) sin kx dk. (149) 


-v 


On comparing these formulae with Fourier’s formula for an odd function: 


+0 co 
f (2) == | J f@sin wat sin az da, 
-0 0 


we obtain for A(k) and B(k): 
A (k) =4 fp (&) sin kë dé; Bwy= -iy fa (&) sin ké dé, 
0 0 


and substitution in (148) gives us the solution of the problem as 


+œ œo 
u (x, t) = + [f G (E) cos akt + an (£) sin art] sin ké sin ka as} dk, 
-%0 0 


or alternatively, if we take into account the evenness of the integrand considered 
as a function of k, as: 


u (2, j= jif po Fn (E) sin akt | sin z ag} sin ke dk. 
0 0 


It may readily be seen by using Fourier’s formula that the right-hand side 
of the above expression coincides with the right-hand side of (17), on the 
hypothesis that p(x) and 9,(x) are odd. 

In precisely the same way, we can consider a boundary value problem for 
the equation 

Ou 3u Fu 
ae = * (ar t a) 


on the half-plane y > 0 with the boundary condition 


u |yao = 0 (150) 
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and with any initial conditions: 


ðu 
U lt=0 = P (T, Y); © |, = p, (x, y) 
=o 


(~œ < gz < + %; y>0). (151) 


We can easily verify that the solution of the problem is given by formula 
(80) on condition that p(x, y) and 9,(z, y) are continued oddly with respect to 
the argument y into the interval (— oo, 0). In fact, with y = 0, the first term of 
(80) can be written as 


x+at [+Vatt#—(a—x)* 
Pı (a, B) 


270 at? — (a — x)? — f2 
on yne l (a — 2)? — p 


dB | da 


and the inner integral is zero for any x and t, since the integrand is an odd func- 
tion of f. Similarly, the second term of (80) also vanishes, so that condition (150) 
is in fact satisfied. We might equally have used Fourier’s method for the 
problem, the functions of two variables being represented by Fourier integrals. 
The proof of the identity of the solution thus obtained with the solution given 
by (80) presents greater difficulty than in the linear case. The wave equation 
can be considered in a similar way in the semi-continuum z > 0 with the bound- 
ary condition u = 0 for z = 0. We may also use Fourier’s method for solving 
the wave equation in the unbounded case, when there are only initial conditions. 
This leads to more complicated working, however, than that carried out above. 


§ 18. The equation of telegraphy 


181. Fundamental equations. Either of the methods explained 
above — that of characteristics (d’Alembert’s) or that of standing 
waves (Fourier’s) — may also be successfully applied in the case of 
the equation of telegraphy, which has a basic significance in the 
theory of the propagation of quasi-stationary electric vibrations along 
cables. 

Let us have a circuit made up of out-going and return conductors 
of length l. We shall assume that the ohmic resistance R, self-induc- 
tance L, capacity C and insulation leakage A, all per unit length, are 
uniformly distributed throughout the circuit, so that the present case 
differs from that discussed in [I, 181], where we had resistance, self- 
inductance and capacity lumped at different points of the circuit. 
Let v and į denote the voltage and current at the section of circuit 
distant x from the end x= 0. These functions of x and t are connected 
by two differential equations which we shall now derive. 
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The law of induction says that the voltage drop over an element dz 
of the circuit: 


v — (w + do) = — do = — 2 de 


is found by adding the ohmic drop R dz » i and the inductive drop 
L dx 9i/dt, or in other words, if we divide by dz: 


wipes R=. (1) 


Further, the difference between the currents entering and leaving 
the element da, i.e. 


Ži ir, 


i— (i+ di) = — di = — ṣ 


is obtained by adding the charging current C dx 6v/dt and the leakage 
current A dg v, so that 


ði dv 
Ge bt Oa + Av=0. (2) 


The boundary conditions that have to be satisfied at the ends of the 
circuit are extremely important. If one end of the circuit is open, we 
must have at this end 


i=0 (for x=0 or w=l). (3) 
In general, if an external electromotive force Æ, a resistance r and 


a self-inductance A are connected to one end of the circuit, we must 
have at this end: 


v= E+ritad (for x=0 or e=l). (4) 

In particular, if a voltage E only is applied to the end x=0, whilst 
the other end x = lis short-circuited, we have 

vlo= E, vhe = 0. (5) 

182. Steady state processes. We shall first say a few words about 

the steady state processes, when the external factors operating on the 

circuit are either (1) constant or (2) sinusoidal, v and ¿ being taken as 


independent of ¢ in the first case. 
Equations (1) and (2) give us in the first case: 


. di 
Ge tRi=0; G+ Av=0. (6) 
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On differentiating the first of these equations and making use of 
the second, we get 


d2y 


<r — RAv=0. (7) 


The function v is given at once by the method indicated in [27], 
and we find 


v (x) = Cye™ + O,e~™, (8) 
where 
b = VRA. 
Having found v, we get i from the first of equations (6): 
i (£) = a n = — Ž (Oye — C,e—*), (9) 


Examples. 1. We have conditions (5) in the case of a circuit with a constant 
voltage H at one end and shorted at the other; hence we can determine the 
arbitrary constants appearing in equation (8): 


O,+0,=E; 0, 4+0," =0, 
whence 
E Ee—™ g =Z” 
el otl — 9 il’ 2 i 


0,=— 


go that we obtain on substituting in (8): 
o%f-*) — eH) sinh b (I — 2) 


v (x) = E —_________ = E H, 10 
@) et! — e7”! sinh b? (01) 
and equation (9) gives 
cosh b(/ — 2) 10,) 
i (2) = n|/(4) “sinh lb (10; 


2. Now let an external sinusoidal electromotive force of given frequency 
w act on our circuit; we can convert the physical quantities acting here into 
vectors, as was done in [I, 180], whilst we shall understand by forced vibrations 
sinusoidal oscillations of voltage and current in the circuit of the same frequency 
w. On recalling the rules of [I, 180] and introducing the current vector I and 
voltage vector V, which depend on x in the present case, we can transform 
the system of differential equations (1) and (2) into: 


SV _ (r+ iw) 1 =0; 4 (4 + iat) V=0. (11) 


On differentiating the first of these equations with respect to æ and using 
the second to eliminate I, we get: 


d?V 


-izr — (E iwL) (A + iw) V = 0, 
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whilst precisely the same form of equation is obtained for I, as may easily be 
shown. 
Thus, I and V are solutions of the same second order differential equation. 
On using the method of [27] and setting 
(R + 4@L) (A + iw0) = x’, (12) 
we have 
V = A” + Ao”, (13) 


where A, and A, are arbitrary constant vectors. On substituting back into the 
first of equations (11), we find for the vector I: 


| 1 dv _ [A + tw0 


R+ioL + ioL “de R + ioL (A,e~* — A,e™). (14) 


The final solution of the problem requires the determination of the con- 
stant vectors A, and A,, by making use of the two boundary conditions (of 
course there is no question here of initial conditions); here, instead of assigning 
a condition for each end separately, two conditions may be stipulated for the 
same end, saying by giving the voltage and current vectors there. 

However this may be, equations (13) and (14) define the vectors of the forced 
oscillations which depend on zv, i.e. they vary both in amplitude and phase along 
the circuit. On representing each vector (m + ni) by a point on the complex 
plane and letting x vary from 0 to l, we get two curves for V and I, the vector 
diagrame of the voltage and current. It must be recalled when finding the shape 
of these curves that x is in general a complex number; on writing 

x =a + ib, 
we have: 


V = Aje™ (cos bx +4- i sin bx) + Ae ™ (cos bæ — i sin ba). 


Each term on the right-hand side gives a spiral [I, 183], and V is obtained 
by “geometric addition” of these spirals; the radius vector of a point of the 
curve for V, corresponding to a given value of v, is equal to the geometric sum 
of the radius vectors of the points of the two spirals with the same value of 
æ. We can say the same as regards the vector I. On introducing the factor 


_ |R+ toL 
= ZHZ. (15) 


which is known as the wave impedance, we can write the expressions for V 
and I as 


V= 4e" 4+ Ao %; I= + (A,e-** — A,e™*). (16) 


If we pass from the vector to the ordinary form, we can write expressions 
for the required functions v and ¢ of the type: 


v= V (x) sin [wt + y (x)j; i= {I (x) sin [wt + x (x)j, (17) 


which give the harmonic oscillations of the same frequency w as the external 
force, the amplitudes V(x) and I(x), and phases y(x) and z(x) of these being 
dependent on the position of the section of the circuit in question. 
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3. The circuit with sinusoidal voltage at one end and open at the other end. 
Let V, denote the voltage vector given at the end x = 0. In addition to equations 
(11), we have the boundary conditions: 


V [xo = Vos I [xer = 0, 
which give us, by (16): 
A, +A =V; A7” — Ao” = 0. 
On solving these equations and substituting in (16), we find 


__y cosh x (l — x) _ Vo sinh x (J — 2) 
Vests osha E E “y cosh x 


With x = 0, we obtain the complex impedance at the point x = 0 as 


cosh xl 
sinh xl ` 


=y 


Po 


183. Transient processes. We propose to compare the two types of 
forced oscillation, (I) and (II), in the same circuit with the operation 
of different external factors. The voltage and current of the type (I) 
oscillation are written »,, ù, and those of type (II) by va, tz- 

If we suddenly replace the external conditions leading to oscillations 
(I) by those with which type (II) must be obtained, instead of the 
transition taking place immediately from (I) to (II), a greater or lesser 
period of time must elapse, during which free oscillations (or transients ) 
are excited in the circuit. The transitional period can be infinite in 
theory but is finite in practice. The transients are characterized by 
their voltage v and current i, and we shall assume that the transitional 
state of the circuit is obtained by adding free damped oscillations to 
state (II), i.e. the voltage and current of the transitional process are 
defined by the sums 

v+u; iti (18) 


At the start of the process, ¢ = 0, these sums must reduce to v 
and 7,. The functions v and i must satisfy differential equations (1) 
and (2) [181] and boundary conditions (3) or (4), depending on the 
conditions imposed at the ends. Above all, they have to satisfy initial 
conditions as well, of the form: 


v |= = (V1 — V) l= = g (x); 


T (19) 
i limo = (i — îz)t=0 = Jir (x). t 


t The factor ¥C/Z is introduced in order to simplify later working. 
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Instead of seeking v and i directly, we shall express them in terms 
of a new unknown function w, where we take 


v= > 


Ox ` 


Equation (2) now gives: 
ai aw ow ə. w 
a tOey t Ae = a lt OG t Au) = o, 
whence 
i+0® 4 Aw =o, 


where c is independent of x. We can in fact take c = 0 without loss 
of generality, since we can add an arbitrary term not depending on % 
to w without changing v = dw/dz. 

We thus have: 


v=, i= 0 — Aw, (20) 


ðw 
“Ox 
and equation (2) is satisfied. On substituting (20) in equation (1), we 
get an equation which must be satisfied by the function w(x, t), i.e. 


a E-ra (CH + 4) — R(CH + Av) =0, 
or 


w 
“Oa? 


=o a — (LA + ROE — RAw =0. (21) 


This is known as the equation of telegraphy. 
We simplify this by introducing a new unknown u(x, t) in accordance 
with the formula: 
w (x,t) = et u (x, t), (22) 


whilst agreeing to choose the constant factor u in such a way that the 
terms containing du/dt fall out in the equation for u. We have after 
differentiating and cancelling out e~“': 


0 0 0 Ou 
sar — LC (uu — 2u Se + z) —(LA+RC) (- pu + a) —RAu=0, 
so that our purpose is achieved if we take 

2uLC — (LA + RC) =0, 


_ LA+RO 
=— z0 ` (23) 
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If we substitute this value for u and carry out some simple re- 
arrangements, we get the equation for u: 


Ou l1 Gu 
et = Lo tr + Ou (24) 
where 
g= FAP, 
= 350 


We first distinguish the case when 6 can be neglected or is strictly 
Zero, 1.8. 


R A 
To (25) 
In this case 
R 
p=, (26) 
and on setting 
1 
I0 =a , (27) 


@u 5 Ou (28) 


which we investigated above. 
The general solution is [164]: 


u (x,t) = 0, (x — at) + 8, (x + at), (29) 
and the constant a = Y1/LC gives the velocity of propagation of the ex- 
citation along the cable. Equation (22) gives: 

w (x,t) = e™™t [0, (x — at) + 0, (x +- at)], 
and finally, we obtain from (20): 

v(x, t) = 22 = e— [0] (a — at) + 65 (2 + ad), 
i(z,t)=—C a — Aw = — e™t [— a06; (x — at) + a09; (x + at) — 
— u09, (x — at) — u00, (a + at) + AB, (x —at) + A0, (£ + at)] = 

= aCe" [0i (x — at) — 0; (a + at)], 


since it is obvious from (25) and (26) that uC = A, and the remaining 
terms go out. Instead of the arbitrary functions 0, and 0, it is more 
convenient to take directly the functions 


pı (x) = 0i (x) and 9, (x) = 4 (x), 
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after which we obtain the final expressions for v and i: 


v (x,t) = et [p (£ — at) + p, (x + at)], 
eT”! (30) 
i (æ, t) = 2 [p, (w — at) — p, (£ + at)], 


where we have put a = YZL/C for brevity. These are the expressions 
that we shall utilize. The functions q(x) and g, (x) are deter- 
mined from the initial conditions (19), which give us: 


pı (2) + pa (x) =g (z); p(x) — pı (£) = h(x), 


whence 


— g(x) +h(z) , 


p(z) = HITRE, 9 (a) = LPO), (31) 


We could reckon the problem solved if the functions g(x) and h(x), 
or what amounts to the same thing, p, (x) and p, (x), were specified 
throughout the interval (—°°, +-°°); but in fact, we only know these 
in the interval (0, 1), and in order to be able to make use of the solution 
obtained we have to continue them outside this interval. We can do this 
with the aid of the boundary conditions as in the case of a string, and 
the physical meaning of the continuation also amounts here to a wave 
reflection occurring at the ends of the circuit. 

The phenomena corresponding to solution (30) are analogous to 
those worked out above for a string. We have two waves here, the 
direct and the reverse, which are reflected on arrival at the ends. The 
essential difference between this case and that of a string is the presence 
of the factor e~“', which diminishes with time and produces the damp- 
ing of the oscillation; the greater the exponent u (the logarithmic 
decrement of the damping), the faster the damping. 


184. Examples. If the end x = l is open, the condition 


tx. = 0 
gives us by (30): 
P: (l + at) = p, (l — at) 


or on replacing at by æ: 
Pa (l + T) = p, (l — 27), 


i.o. the wave is reflected without change of magnitude or sign at this end, since 
p(x) is an even continuation of p,(x). The same is evidently true if the open 
end is at z = 0. 

If the end x = 1 is short-circuited, i.e. 


vj =O, 


544 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS [184 


we obtain on taking (30) into account and replacing at by x: 
Pı (l + 2) = — 9, (l — z), 


i.e. the wave is reflected with a change of sign but with the same absolute value, 
since p(x) is an odd continuation of p (x). The further continuation is obtained as 
in the case of a string. 

1. The circuit open at one end has a harmonic alternating current of frequency 
w switched into it. The harmonic oscillations deduced above [182] correspond 
to the final steady state (II): 


v, = V (a)sin[ot+y(x)]; i, =I (x) sin [wt + x (a)]. 
If the circuit was empty before the inclusion of the current, we have: 
%=0, à =0. 

Hence the initial conditions become, by (19): 
v limo = — V (a) sin y (z) = g (x) , 
i leo = — T (x) sin x (x) = h(a). 

The boundary conditions are as follows. At the end x = l we must have: 

i [xmp = 0. 


We can take at the end x = 0: 
v |x = 0, 


since in the present transient process we are only interested in the oscillations 
that are due to the difference of the initial conditions of the circuit from the 


“4-3 -2l -4 GO 4 2 KH 41 Sl ôl 


Fra. 138 


forced oscillations of frequency w. We define 9,(x) and p(x) in accordance 
with (31) and then continue these, in an odd manner past the end z = J, and 
evenly past v = 0. 

2. We consider the damped process due to the initial conditions 


mom — E; tho = 9, 
where Æ is a constant, and with the boundary conditions 


vlw = 9 5 i l= 0. 


Equations (31) give 
1 
pı (x) = p, (£) = -7E where O0<gr<l, 


and we find from the boundary conditions: 


pı (— 2) = — p (£);  pı(l— a) = p, (142), (32) 
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from which it is clear that p(x) is an even continuation of p,(x) in the interval 
(l, 2l), whilst p (x) is an odd continuation of p(x) in the interval (—1, 0), i.e. 


9: (0) =— = for 0O<2<2l 


ŞE for —l<s<0 


Pı (1) = 1 
-7 for O<e<l. 


On writing (l + x) for x in the second of equations (32) and comparing the 
equation obtained with the first of (32), we get: 


pz (2l + x) = — p, (2), 
and similarly, 
pı (21 — x) = — 9, (— 2), 


i.e. p (x) and p(x) change sign on addition of 22 to the argument, so that their 
period is 41. 

On putting together everything that has been said, we see that ¢,(x) and 
p(x) are in fact the same, their graph being illustrated 
in Fig. 138. V+Y, 

We obtain v andi by moving this graph to the 
left and right with velocity a; vis half the sum of the 
ordinates multiplied by e~“‘, and ¢ is half the difference 
of the ordinates multiplied by e~“"/a. 

The graph of the voltage at the end x =Z} is shown 
in Fig. 139, the steady state v, = E being added to 8 T 2 3ST 
the free oscillation v. The r = 4l/a denotes the period F 

s 1a. 139 
of the free oscillation. 

If we have ohmic resistance r,, self-inductance å; 
and capacity y; at the end x =1, conditions (4) give the following relationship 
for continuation of p(x) in (l, 21): 


-ut 
oH [p 0 at) + pa 0+ a0] = [ri + a-i) E at-a gan. (33) 


On replacing the argument at by xv, this leads to a differential equation for 
the unknown function: 


® (x) = p, (l + 2) 0<2<l. 


We get a similar result for the continuation of ¢,(x) in(—1l, 0) by applying the 
boundary condition at the end x = 0. 

3. The circuit is terminated at x = l by the pure ohmic resistance r;. Equa- 
tion (33) now becomes: 


—pt e Ht 
e7” [p (2 — at) + p: (l + at)] =r 


a [p (l — at) — pa (l + at)], 
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whence we find p(l + x) after writing x for at: 


Ty — a 
1 +a 


The wave is thus multiplied by the factor q on reflection at the end « = 1 
in the present case. Obviously, | g | < 1, i.e. the absolute value of the wave 
diminishes or remains constant, with absorption occurring in the former case. 
The factor vanishes r; = a, and total absorption of the wave takes place. With 
ri = ©, q = l and we get reflection of the wave without change, which is 
obvious since this case is equivalent to the open circuit case. 

Having thus continued p(x) into (J, 21), and correspondingly ,(x) into (—i, 0), 
we can continue p,(x) into (21, 31) in accordance with (34), and so on. 

Of course we no longer obtain a periodic function in this case, and assuming 
|g | < l, subsequent reflections lead to stronger and stronger absorptions. The 
function p(x) is now defined for x > 0, and 9,(z) for x < l; but this is just 
what we need, since the arguments (l — at) and (l -+ at), on which p,(x) and 
p(x) depend, in fact satisfy these inequalities. 


Ppa (l + x) = gp, (l — 2), q= (84) 


185. Generalized equation of vibration of a string. We have con- 
sidered the equation in the particular case 6 = 0. Before turning to 
the general case, we investigate theoretically the generalized wave 
equation in the linear case: 


ay og Ov Ov dv 

a T ga T aa T O, (35) 

where we take the first coefficient a? as positive and the remainder of 

arbitrary sign. We replace v by a new required function u in accordance 
with i 

v = et BX y (36) 


and show as above that an a and £ can always be chosen so that terms 
containing first order partial derivatives fall out in the equation for u. 
We substitute (36) in equation (35), cancel out e“ + and collect like 
terms, and arrive at the equation 


ə? au ðu ` Ou 
ae =0 g F (a, +2a?ß) Ja T (a, — 2a) -5r + 


+ (as + ap? + ab + aa — a?) u, 
or, after setting a = a,/2, p = —a,/2a?: 


Ou u 
T = qa? J + eu, (37) 


where the coefficient c? can be either positive or negative, i.e. c is to be 
reckoned either positive or pure imaginary. 
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We shall solve equation (37) for an infinite x axis with the initial 
conditions: 
ðu 


pages Or == (x). (38) 


“| 


t=0 


Instead of solving directly the problem posed by (37) and (38), we 
attack the problem defined by the following equation and initial 
conditions: 


Pw Pw Ow 
a T de- + i) (39,) 
—o 3 = cyla . 
wl = 0; OE es = w (x) e's , (393) 


We can write down the solution of this immediately on using ex- 
pression (80) of [172]: 


o (a) o2 da ag 
(x, Y, t) = zaJ J Yatt — (a — g)? — (B — y)? i 


where Car is the circle with radius at and centre (x, y). On introducing 
new variables a’ = a — 2 and f’ = $ — y instead of a and 8, we can 
transform the double integral to that over the circle C4, with radius a 
and centre at the origin: 


w (a’ + x) ota da’ ap’ 
wizyt =a f [2 f 


yat. — a^? — a2 pa 
Cat 


or alternatively, after taking e”? outside the integral sign, we can 
write 


w(x, y, t) = 6'2 u(x, t), (40) 
where the factor 
w (a’ + 2) of't da’ ap’ 
al) Yat? — a’? — p” (41) 
Cat 


is clearly independent of y. We show that (41) in fact solves our original 
problem, i.e. satisfies equation (37) and initial conditions (38). The 
function w satisfies equation (39,), and on substituting expression (40) 
in (39,), we obtain equation (37) for u after cancelling out e”'*. The 
initial conditions for u are obtained at once from initial conditions (39,) 
for w and expression (40). Thus (41) gives the solution of equation (37) 
with initial conditions (38). We shall write the expression on the right- 
hand side of (41) in a new form. 
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We reduce the double integral over the circle Cg; to two quadratures: 

u (x, t) = 
tat + yatan oth" ja 


1 
— 2na f f yat —a? — p” 
~at ~ yanai 
We replace the variable of integration £’ of the inner integral by a new 
variable g, in accordance with 8’ = Vat — asin p, which gives us 
the integral in the form: 


ap’| w(a@’-+a)da’. (42) 


+aj2 : 
o Yatta! sin p dy 


—al2 


On introducing a new transcendental function I(z), defined by an 
integral and depending on the parameter 2: 


+n{2 
I(z)= 2 f e% 5in e do, (43) 
—n]2 
we can write (42) in the form: 
+at 
u (x, t) = a f I (2 azt — a?) w (a’ + x) da’, 


or, on introducing as variable of integration a = a’ + x: 


x-+at 
u (x, t) = a f I (= yat — (a — z7) w (a) da. 
x--at 
We differentiate the solution obtained with respect to ¢ and obtain 
as in [171] a new solution u, = du/dt of equation (37), no longer 
satisfying initial conditions (38) but instead the conditions 


yp Sus 2 
u ki = o (2); E I Os (44) 
The solution of (37) satisfying the general initial conditions 


Ls = ọ (x); ea e = 9; (x), (45) 


is simply obtained by taking w(x) = p, (x) in initial conditions (38) and 
a(x) = p(x) in conditions (44) then adding the respective expressions 
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for u; this gives us 


u(x,t) = ii (= Vat — (a — x)? p (a) da + 
x—at 
x+at 
+l J n (e yae =a- z7?) » (a) dal. (46) 


On carrying out the differentiation with respect to t both for the 
upper and lower limits and under the integral sign, and noting that 
I(0) = 1 by (43), we can rewrite (46) in the form: 


E E 


+ T (= Var? = (a — 2?) pı (a) da + 


x—at 


x-+at 
ct 1 phe Nise Sa 
Pa) mea (ee =O eter 
x—at 
where I’(z) denotes the derivative of I(z) with respect to the argument z. 


We now establish a relationship between the function J(z) and the 
zero order Bessel function [48]: 
—1)§ 2s 
‘ar (3) - (48) 


w 
ie He 
s=0 ( 
zsing 


On expanding e in @ power series: 


œ ne n 
zsin g — 2z sin g 
$ z a n! , 
n=0 
then integrating term by term over the interval (—2/2, +7/2), as 
is possible in view of the uniform convergence of the series, we get 


e +n]/2 
I(z)= aa zl, sin” g dø. 


The integrals on the right evidently vanish with n odd, whilst we 
have for even n = 2s [I, 100]: 
+nf2 n[2 


- B dae __ (28 — 1) (28 — 3)...1 
[ee eae 28 . (28 — 2)...2 my 
-7 
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whence it follows that: 
_ 2 (Qe — 1) (28 — 3)... 1 
12) = È Gay "9s. (@Qe—2)...2”’ 


or 
S 1 f2)” 
On comparing this expansion with (48), we see that 
I (2) = Jo (iz). (50) 


186. The general case of an infinite circuit. We shall now discuss 
the equation of telegraphy for an infinite circuit. A preliminary obser- 
vation concerns equation (21), obtained for the auxiliary function w 
in [183]: this is also the equation which must be satisfied individually 
by the voltage v and current 7. 

We return to the fundamental equations (1) and (2) and eliminate 7. 
We do this by differentiating equation (1) with respect to w then re- 
placing 07/0” by its expression from equation (2): 


3w @i gi 
pat tL age TE TO 


ie. 
zv ð Ov Ov 
gr — L (0 a + 40) — R (0 | + Ao) =0 
whence we have equation (21) for v: 
3? 0? ð 
g ~ LC ar — (LA + RC)-5-— RAv=0. (51) 


If we had eliminated the voltage v from equations (1) and (2), we 
should have again arrived at this equation as the equation for i. 

Having determined v, we can find 7 so that it satisfies equations (1) 
and (2). For instance, use of equation (2) gives us: 


i= — (0 Ẹ + 4o) dr + Bi, (52) 


where the integration is with respect to x with constant t and B(t) is 
at present an arbitrary function of t. We set this expression for 7 in 
equation (1) and differentiate with respect to the parameter t under 
the integral sign: 


$e f(r + Aiar- f (re + RAv) dx + 


+ LB’ (t) + RB(t)=0. (53) 
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Differentiation with respect to x of the sum of the first three terms 
gives us zero by (51), i.e. the sum is a known function of ¢ only, and we 
obtain a linear first order equation for B(t). The arbitrary constant 
obtained by integrating this latter equation is in general defined by 
the initial conditions. 

As above in [183], equation (51) reduces to the form 


Ou 1 Ou 


ot = LO ext + OU (54) 
with the aid of the substitution 
v (x, t) = e™t u (x, t), (55) 
where 
pot, ool 


If we are given v and # along the circuit at t = 0, we must also know 
dv/dx and 0i/dx at t= 0, whilst equations (1) and (2) give dv/dt and 
di/dt at t = 0. Hence we can suppose that, along with equation (51), 
we have the general initial conditions: 


Ov 


|, P(x); ar | _ = Y (2). (57) 

We make use of (55) to obtain the following initial conditions for u: 
| du 

u i = D(x); Or ii = u® (x) + Y (x). (58) 


We finally get, on applying equation (47) for u and taking (55) into 
account: 


v (z, t) = 5e [Ð (æ — at) + © (z + at) + 


x+at 
++ f [u® (a) +Y (a)] 1 ($ yer —(a— 2) dat 
x—at 


x-+at 


es f 1 Vv (Ver —(a—2)") D (a) da}, (59) 


Rie Va?i? — (a — a)? 


where u and c are as given above, anda = 1//ZC. 

We have here a definite velocity a of propagation of the excitation, 
as in the case of a vibrating string; thus, if the functions (x) and W(x) 
giving the initial excitation differ from zero only in a finite interval 
p<x<q and we apply (57) at the point x vhere x >q, we must 

have v(z,t) zero up to the instant t= (x— q)ja. The essential 
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difference from the case of a string lies in the fact that v(x, t) 
does not vanish or become constant after passage of the rear 
front of the initial excitation, but is still a function of x and t. In fact, 
if t> (x— p)ja, the terms outside the integral signs in (59) vanish, 
whereas the integrals remain, the interval of integration being the con- 
stant (p, q). Nevertheless, the variables x and ¢ appear as parameters 
under the integral signs. 

If, for instance, current is absent in the circuit at t = 0, whilst the 
potential v is given by D(x), we have by equation (2): 


Ov 


A 
ara —GEP(z). (60) 


If we take A = 0, ie. neglect the leakage, the right-hand side is zero. 


187. Fourier’s method for a finite circuit. Fourier’s method may 
readily be used for integrating equation (51) with given initial and 
boundary conditions in the case of a finite circuit. Let the end x = 0 
be subjected to a given constant voltage E, whilst v = 0 at the other 
end, so that the boundary conditions are 


u |x=0 = E, vha =0. (61) 


Furthermore, let there be neither voltage nor current in the circuit 
at the initial instant ¢ = 0, i.e. 


v limo = 0, i |im0 = 0 (62) 


for O0<a<l. 
Equations (1) and (2) show us that here: 


ai 
# leo = O° (63) 


We thus have to integrate equation (51) with boundary conditions 
(61) and with initial conditions 


ðv 


v |0 = 0; E 


ica (O<#<l). (64) 


We start by constructing a solution of (51), v = F(x), which depends 
only on g and which would satisfy boundary conditions (61). We ob- 
tain as an equation for F(z): 


F” (x) — bF (x) = 0 (b? = RA). 
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We actually found the solution of this equation in the example 
of [182], which also satisfied conditions (61): 


sinh b (l — æ) 
sinh bl ` 


F(x) =E (65) 


We now introduce a new required function w(x, t) in place of v(x, t), 
in accordance with 
w (x,t) = v (x, t) —- F (x). (66) 


We have the same equation (51) for w(x, t), together with the homo- 
geneous boundary conditions 


w |x=0 =0; w [x= =0 (67) 
and the initial conditions 
ðw 
wlio = — F(z); GP ro O: (68) 


We shall simplify the writing by putting equation (51) for w in the 
form: 


Be ge BY oh _ bw =0, (69) 
where 
a?=LC; 2h=LA+RC; b= RA. (70) 


We now proceed as usual with Fourier’s method. We seek a solution 
of (69) as the product of a function of x only and a function of t only: 


w= XT. 
We obtain on substituting in (69) and separating the variables: 
X” _ aT" 4 CI + eT mèn? 
Xo T ~ 


where m? is at present an arbitrary constant. We have two linear 
equations with constant coefficients: 


x4 Ex =0 


aT" + 26T + (B+ PRE) 7 =0. 
We now take the boundary conditions (67) into account and select 


the solutions 
IMAL 


l 


X m= sin (m =1,2,...) 
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of the first equation, where m is a positive integer. The equation for T 
has the general solution 

T n = Am ert + Al, emt, 
where Am and Aj, are arbitrary constants, whilst am and am are the 
roots of the equation 


ala? + ha + (bE +- ma?) = 0, (71) 


the circuit constants R, L, C and A being assumed to be such that this 
equation has different roots for any integral m. We thus obtain an in- 
finite set of solutions satisfying the boundary conditions: 


= (Apm emt + A’, emt) sin 2T (72) 
We take the sum of these solutions: 
w= & (Am emt + A’ emt) gin (73) 


m=1 


and select the constants Amand An so as to satisfy the initial conditions 
(68). This gives us: 


È Ant Ae) sin = — F (x) 
(0<2#<l) 


> (Am Am + am Am) sin 17 
m=1 


Having found the Fourier coefficients in the ordinary way, we get 
two equations for Am and Am: 


ri mE de 
Am F Am [rE sin (74) 


am An + af, Al, = 0. 


Having substituted for F(x) from (65), we can carry out the inte- 
gration and obtain: 


l 


2 . Mng 2mn 
ifr (x) sin—7— dx = a5 F mī E. 
0 
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We have after solving simultaneous equations (74): 


A = 2mm ge tm 
m ~~ b3 +. min? am — am’ Mmo DRF mint am — am ` 
On substituting this in expression (73), we get: 


amt gietat | mag 


_ e 2m7 ame 
arD bz F min? ` am — am pii E (et (75) 


The roots of equation (71) are either real and negative, or complex 
conjugates with negative real parts. In either case, solution (75) is 
damped with increasing ¢. It defines the transient process from the 
empty circuit to the steady state defined by function (65). Equation 
(66) gives us the final expression for the voltage: 


__ yz Sinhb (l—a amem gn etmt 
=% sinh bl LE 2 ae . am an sin—>—. (76) 


Solution of quadratic equation (71) gives us roots of the form 


m= —V + kbm In = — Y — Ekm (77) 
where 
yo, kaa eRe PPL min) (78) 
T a? m ~~ al 7 


On substituting in (76), we can rewrite this as 
sinh b (l— x) 
Me a ob 


— Ee” 5 2m 


2 272 
| ee 


v= 


(cosh km eee — sinh k nt) sin a ; (79) 


We now find ¢ by the method of the previous section. Equation (2) 
gives us: 


a sinh b (l ~ x) 
Ox AE sinhe T 


IRE 2 . , 
+ AEe-* = aaa (cosh kmt + a sinh Int) sin E 4 


MNL 


2 . 
+ CEe—” = BR ae (tm — +) sinh k„t sin Ta 


or, on observing that, by (78): 
2 p Bb mint 


al? 
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we can integrate with respect to x and use the fact that a? = LC: 


_ AE _ cosh b (l — 2) a 


i 


b sinh bl 
E l v MAL 
i SSM) ee = 
2AEle = Bit | wine (cosh kmt + i sinh Int) cos —— + 
+e > =, sinh Int cos 2 +. B (t). (80) 
m=} 


We find an equation for B(t) on substituting in equation (1): 


LB’ (t) + RB (t) = 0, 
whence 
B (t) = Bye FE, (81) 
where B, is an arbitrary constant which has to be determined from 
the condition that iis zero throughout the length of the circuit at t = 0. 
We substitute expression (81) in (80) and then set t = 0 and i= 0, 
which gives us 
AE cosh b (l— x) 


O= a 


b sinh bl 


z 1 
— 24 El Y sa pane 8 T+ By. (82) 
m=1 


But the cosine Fourier expansion of the first term on the right in 
the interval 0 < x < l is 


AE coshb(l— zx) AE = I max 
banh te oS parae Og ORES, 
m=i 
and condition (82) gives: 
AE E 
Bo — -r = R 
so that 
B (t) ae E o-RtL 
RI i 


Substitution of this expression for B(t) in (80) gives us the final 
expression for the current. 

A detailed discussion of the above method of solution can be found 
in A. V. Krylov’s article “The propagation of currents along cables” 
(Zhurnal prikladnoi fiziki, Vol. VI, sec. 2, p. 66, 1929). 


188. The generalized wave equation. We considered the generalized 
wave equation in the linear case in [185], i.e. with two independent 
variables. We can use the same method for the generalized wave 
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equation with three or four independent variables. We shall take the 
velocity a = 1 in the equation, so as to simplify later expressions; 
these can be turned into formulae for any a simply by replacing ¢ 
in them by at. 

We take the equation for the infinite plane: 


92 32 0? 
Gea Get toe 6s) 
with the initial conditions 
ðu 
ap BE hey T2 OY) (84) 


We firstly consider a new problem, that of integrating the wave 
equation 
Fw Ow Zw ow 


we ee tae tae 


with initial conditions 


= = w (x,y) e”. 
it=0 Ot |t=0 (x,y) 


The new problem is solved at once by Poisson’s formula: 
27 n 
w= af fo (x +tsin@cosg, y -+ tsin 0 sin g) ef @*# 8 sin 0 dô dg. 
ð 0 


We can rewrite this in the form: 


w (x, Y, 2, t) =eřu (x, Y, t) ’ 
where 


u (zx, Y, t) = 
2n x 


=- | [o (@+tsinb cosy, y -+ tsin Osin g) e? sin 0 dd dp, (85) 
ò ò 


It can be shown, precisely as in [185], that this latter function satisfies 
equation (83) and initial conditions (84). We now simplify (85) by 
replacing 0 with a new variable of integration ọ in accordance with 
t cos 0 = ọ, from which we have: 


tsin 0 dé = — do, sind = 1- $- 
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The integral over 8 in (85) becomes with the new variable: 
+t 
fof + VÈ — @? cos p, y + VË — esin g) e“ do, 
Lt 


which may be written, if we split the interval of integration into (—t, 0) 
and (0, £) and replace ọ by (— ọ) in the first sub-interval: 


t 
Z fo (w+ Vt — o? cos p, y + Vi? — o? sin p) cosh ce de, 
ò 


so that (85) may be written as 


u(x, y, t) = 
t 2n 
=i [foe VE — e2 cosy, y + Vt? — o? sin p) äp |cosh cede. 
ò 0 


The integration with respect to p in this expression gives the 
arithmetic mean of the function w(x, y) over the circle in the zy plane 
with centre (x,y) and radius //2 — 92. On denoting this arithmetic 
mean as Typrg {o(x, y)}, we can write (85) in the final form 


t 
u (x, y, t) = È T yaza {w (x, y)} cosh ce do. (86) 
ò 


We note that if c = c i is pure imaginary, cosh cg = cos cg. On 
differentiating our solution with respect to ż we get the solution 
u, = du/dt of equation (83) satisfying the initial conditions: 

Ou, 


Uy page y); aE meer 


Similarly, the integration of the equation 


Ou Ou Ou eu 
az Gat + oye t oe + OU (87) 
with the initial conditions 
ðu 
A = i oO = 88 
u ae 0; at lio w (x, Y, 2) (88) 


requires the use of formula (82,) of § 17 [173] with n = 4, œ being 
replaced by (22, %3,%,)e™!. After some simple rearrangements, we 


189] FUNDAMENTAL EQUATIONS 559 


obtain the solution of (87) with initial conditions (88) in the form: 
t 
1 ə ; yeZ 
u= t Ot eI (ic = e?) Te {w (x, Y, z)} de , 
0 


where T, {«(x, y, 2)} is as usual the mean of w(z, y, z) over the sphere 
of radius pọ and centre at (x, y, 2). 


§ 19. The vibrations of rods 


189, Fundamental equations. Fourier’s method is used for a number 
of problems of mathematical physics which lead to partial differential 
equations. We arrive at expansions of a given function into the 
functions originating from application of the method. We have had 
examples of such expansions in the series of problems worked out above. 

A further example is presented by the transverse vibrations of a rod, 
the equations for which we shall now deduce. 

A thin rod differs from a string in that it does work on bending. 
The required function is the ordinate y(x, t) of the axis of deformation 
of the rod with abscissa x and at the instant t. 

If M is the bending moment and F(z, t) the loading per unit length, 
we know [16] from the theory of flexure that 

az aM 
Elg =M; yr =F, (1) 
so that we find on differentiating the first equation twice with respect 
to g: 


BI SY = F (x,t). (2) 


Equation (2) would express the condition of equilibrium of the rod 
if the force F were independent of time and the rod remained at rest. 
To obtain the equation of motion, we have to include the inertia force 
per unit length along with the external force, in accordance with 
d’Alembert’s principle. The acceleration of a section x can be reckoned 
constant at all points of the section and equal to d?y/dd?, the inertia 
force being evidently found by multiplying the acceleration by 98, where 
g is the volumetric density of the material of the rod and S is the cross- 
sectional area of the section. We must thus replace F by F — o8 3?y/3t? 
in equation (2), which gives us the fourth order equation: 

Oty 


Se 4 oe AY = fw, t), (3) 
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where 
5 EI 1 
b = Fe. I t) = og F(z, t). (4) 


Great importance is attached to the boundary conditions which must 
be satisfied at the ends x = 0 and z= of the rod, the form of these 
being dependent on how the respective ends are fixed. If an end is 
fixed rigidly, so that the rod has a horizontal direction at that end, 
we get the two conditions: 


y=0, #0 for x=0 or w=l. (5) 


If an end is merely supported, i.e. free turning is possible about the 
fixing point, the bending moment must be zero at this point, i.e. we 
have the conditions 


By 
> ðr? 


y=0 =0 for x=0 or x=l. (6) 

Finally, if an end is free, both the bending moment and the shearing 
force 01/dx must be zero at that end, whereas y itself may now 
differ from zero. Hence in this case 


== 0, SY 0 for x=0 or r=l. (7) 
In all these cases, we get a pair of conditions for each end, contrary 
to the string, where we had one condition per end. 
We also have to bring in initial conditions, of the same type as we 
had with a string: 


z vj 
y t=0 =P (x) ? Ot t=0 =f (x). (8) 
As regards the free vibrations, we put /(z, ¢) = 0 in equation (3), 


which gives 


Oy Oty 
oe t ae = 9. (9) 


190, Particular solutions. As in the case of a string, we seek a 
particular solution of this equation in the form 


y =T (t) X (x). (10) 
We find by substituting in (9) that 
T” (t) X (x) + WT (t) X (x) =0, 
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or, a8 for a string: 


re) ZOA is 
TA  X@ o? 
where k4 is constant, k being assumed real. 
This gives us: 
T” (t) + bk T (t) =0, (11) 
XA (x) — kt X (x) = 0. (12) 


The general solution of equation (11) is 
T (t) = N sin (bkt + oy), (13) 


i.e. solution (10) is again a standing wave, such that points of the rod 
perform harmonic vibrations of the same frequency and phase but of 
different amplitudes NX(x) which depend on z. 

The general solution of equation (12) can also readily be found. 
Its characteristic equation [30] is 


at — kt =0 
with roots k, —k, ik, —ik for k # 0, so that its general solution is 
X (x) = O,e™ + C,e™ + C3 cos kx + Cysin ka , (14) 


which may be written alternatively, on expressing e and e~™ in terms 
of cosh kz and sinh kz and changing the arbitrary constants C, and C,: 
X (x) = C, cosh ka + O, sinh kx + C; cos kx + O, sin ka. (15) 
We now distinguish the various cases of boundary conditions. 
1. If the rod is supported at both ends, we must satisfy conditions 
(6) at x = 0 and v = l, i.e. 


X(0)=0,+0C3;=0; X"(0)=k?(C, —C,) =0 
X(l) = C, cosh kl + C, sinh kl + Cy cos kl + C,sin kl = 0 
X” (l) = k? (C, cosh kl + C, sinh kl — C3 cos kl — Cysin kl) -=0, 
and these obviously give with k # 0: 
C,=C;=0; (16) 
C, sinh kl + C,sinkl=0; C,sinhkl—C,sinkl=0.  (16,) 


The last system of simultaneous equations has the obvious solution 
C, = C, = 0, in which case all the constants C are zero and we get 
the trivial solution X(x) = 0. We neglect this case and assume that 
at least one of constants C., C, differs from zero. 
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If C, = 0, it follows from (16,) that C, = 0 since sinh kl ¥ 0 for 
k # 0 [1,177]. We must therefore take O, # 0. We get C,sin kl = 0 
on subtracting the two equations (16,), so that we finally have the 
equation for k: 


sin kl = 0. 
If this condition is satisfied, equations (16,) now reduce to C, sinh kl 
= 0, so that C, = 0; thus we obtain from (14), on writing C, = C: 


X (x) = Csin kg. (17) 
Equation (17) gives the same values for k: 


2a na 


n 
Tr Tec Test? 


as in the case of a string, and the subsequent arguments and formulae 
are the same as in [167], the only difference being that the frequency 
wn of the nth harmonic is given by 


bnn? 
On = a (18) 
instead of by expression (44) [168]. 

With k = 0, equation (12) has the general solution: X(æ) = 0, + 
+ C, 2 + C, 2? + 0,23, and we find on attempting to satisfy condi- 
tions (6) that all the constants C must vanish. 

2. If the rod is rigidly constrained at both ends, we must satisfy 
conditions (5) for æ = 0 and v = l, which gives: 

X(0)=0C,+C,=0; X’(0)=k(C,+C,) =0 
X(l) = C, cosh kl -+ O, sinh kl + C, cos kl + C, sin kl = 0 
X’(l) = k (C, sinh kl + C, cosh kl — O; sin kl +- C, cos kl) =0. 
Hence it is clear that 
C3=— 0; 0=-—0,, (19) 
and we get the system of simultaneous equations for C, and C,: 
C, (cosh kl — cos kl) + C, (sinh kl — sin kl) = 0 (20) 
C, (sinh kl + sin kl) + C, (cosh kl — cos kl) = 0. 


The necessary and sufficient condition for this system to have a 
solution differing from C, = C, = 0 is that the coefficients of C,, C, 


are proportional: 
cosh kl —coskl _ sinh kl — sin kl 
sinh kl + sinkl ~ cosh kl — cos kl 
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In this case, the two equations (20) reduce to one, which can be 
written, on using the relationships: 


cos? g + sin? x = 1, cosh?2 —sinh?2=1, 


in the form: 
cosh kl- cos kl = 1. (21) 


We have obtained an equation for k analogous to (17) in the previous 
case. On writing 
kl=1, 


for brevity, we have a transcendental equation for A: 


cosh A cos À = 1. (22) 


Fre. 140 
On writing (22) as 
1 
cos Ase cosh A 


and drawing the graphs of the left and right-hand sides (Fig. 140), 
we find that (22) has an infinite set of real roots: 


ey Ae De oer SA 


where the difference 


2n +1 

An — mileo as g>. 
We shall only pay attention to the positive roots for the present: 
E ER EE (23) 


These correspond to an infinity of values of parameter k: 


A E E A 


An 
TT sae (24) 
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Condition (21) is satisfied for these values of k, one of equations (20) 
is implied by the other, and we can put: 


C, = C (sinh kl — sin kl); C, = — C (cosh kl — cos kl). 


On writing for constants C;, C, their values given by (19), substitut- 
ing in (15) and setting C = 1, which we can obviously can do without 
loss of generality, we obtain the solution X(z) as: 


X (x) = (sinh kl — sin kl) (cosh ka — cos ka) 
— (cosh kl — cos kl) (sinh kx — sin kz). (25) 


Strictly speaking, we get the infinite set of solutions, 


X, (x), X_(x),..., Xal) (26) 

found by substituting k, for k in general formula (25). 
We cannot make use of the negative roots, —A,, —A,, ... since they 
correspond with the values —k,, —k,, ... of the parameter, which 


yield the same sequence of functions (26) in view of the oddness of 
functions (25) with respect to k. 

On replacing k by its values (24) in (13), we find the corresponding 
sequence of functions T(t): 


T(t), T,(t),..., Tat), ...3 T(t) = N nsin (Wat + Pn); Op = OKR, (27) 
and finally, the sequence of solutions of equation (9): 
Ya (E, t), Yo (2t) oo 5 Yn (8t), -< -3 Yn (8t) = Ty (t) Xn (£). (28) 


We obtain exactly analogous results for all the remaining conditions 
for the ends of the rod: having expressed the function X(x) in the form 
(15) and substituted it in the boundary conditions, we get a system 
of four simultaneous equations with four unknowns C,, Cz, C3, Cy, 
which have a non-zero solution when and only when the parameter k 
satisfies a certain transcendental equation having an infinity of real 
roots. Substitution of a root k of this equation in the coefficients 
of the system gives a system in which one equation is a consequence 
of the other three, and constants C,, Cz, C3, C, are completely determin- 
ed except for an arbitrary common factor: thus we obtain the functions 
Xn (x) as linear combinations of ordinary and hyperbolic sines and 
cosines. 
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191. The expansion of an arbitrary function. We shall not stop to 
work out all the particular cases in detail but shall consider how to 
satisfy the initial conditions: 

by 


y bos =g (x), Ot 


We use the same method as in the case of a string and seek y(x, t) 
as a sum of particular solutions (28): 


= P: (2). (29) 


t= 


y (2,0) = Syn (at) = ST (t) Xn (2). (30) 
n=1 n=l 


On setting: 
a,=N,sing,, On = N n CO8 Ors 
(30) becomes: 
y (x,t) = X (a, cos wnt + bn sin nt) X, (x), (30,) 


n=l 


and conditions (29) give: 
D> an Xn (x)= p (2); X On On Xn (x) = p, (2). (31) 
n=! n=1 


We see from this that the problem of finding the coefficients an, bn 
reduces to that of expanding the given functions g(x), y,(x) into series 
in the functions X,(x). These latter series are analogous to the Fourier 
series considered above. 

We shall follow the same method as for the Fourier series in [142] 
and merely show how to find the coefficients of these expansions; 
we assume that the expansions are possible and omit any discussion 
of their convergence or divergence. We also assume here that the 
boundary conditions of the problem are not necessarily those enume- 
rated in items 1 and 2 of [190], but are any of (5), (6) and (7) above. 

Let f(x) be a function given in the interval (0, l) and let it be 
required to expand this in the form: 


f(x) = > An Xn (x). (32) 
n=! 


We assume that this expansion is possible and that series (32) can 
be integrated term by term. The coefficients A, can be found by 
virtue of the orthogonality of the functions 


X, (a), X,(x),..., Xp (2)... 
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in the interval (0,2); for we shall now show that 
I 
| X (2) X_(a2)da=0, if ngm. (33) 
6 
We do this by first noting that the X,(x) satisfy equation (12) by 


definition, if the k there is replaced by kp, i.e. 


XP (x) = kå X, (2). 
We now have: 


XM (x) = kh Xa (2); XP (2) = kn Xm (2). (34) 


On multiplying the first equation by Xm(x) and the second by 
X,(z), then subtracting and integrating from 0 to l with respect 
to x, we get: 


l 
(k$, — kaS Xn (2) Xn (2) da = 


I 
= J EE @ Xn (2) — XP (2) Xm (2)] de, (35) 
After this, we only have to show, in order to prove (33), that 
I 
f [XY (x) Xn (£) — XP (w) Xm (2)] de = 0, (36) 


since the factor (k$, — k4) does not vanish for m ¥ n. 
Integration by parts gives us: 
S XQ (x) Xp (2) de = X (2) Xn (2) — f Xm (2) Xn (2) de = 
= Xin (2) Xn (x) — Xm (£) Xp (x) + f Xm (2) X} (2) dz, 
and similarly: 
S XP (2) Xm (£) da = Ky’ (x) Xm (2) — Xa Xm (x) + f Xn (2) Xh (x) de, 
whence we readily deduce that 


l 
i (XY (a) Xn (x) — XW (x) Xm (x)] da = 
= [Xin (x) Xn (£) — X” (x) Xm (2)] 35 — 


— [Xa (2) Xp (2) — Xn (2) Xm (2)] [š 


The right-hand side of this last equation contains the values at 
x = 0 and g =l of X,,(x), Xn(x) and their derivatives up to and 
including the third order, and whichever of conditions (5), (6) or (7) 
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we take, a vanishing factor is found in each term of the right-hand 
side. Thus equation (36) is proved, and with this the orthogonality 
expressed by (33). 

With m = n, integral (33) becomes 


l 
I = J X? (x) da (37) 


and is a well-defined constant which can readily be found in each 
particular case. We have, for instance, in case 1 [190]: 


1 
I, = | sin? — dz = 
Ò 


i 
z 


ít follows that if we change the system of functions X,(x), X (x), 
..+, &,(x%) into the system 


Xie) Xa) Xa) 
YI, $ Yi, p] , VI, , , 


we obtain a normalized as well as orthogonal system [148], i.e. the 
integral of the square of each function is unity. We return to the 
determination of the coefficients A, of expansion (32). On multiplying 
both sides by X,,(x), integrating with respect to x from 0 to l, and 
taking account of relationships (33) and (37), we find at once that 


l 
JEE) Xm (2) de = Am Im 
whence 


l l 
j Í (x) Xm (x) de fro Xm (x) dx 


m Im 


A l 
j X2, (x) dz 


We thus arrive at an expansion resembling a Fourier series, of an 
arbitrary function f(x): 
l 
e f1 @ Xn (2) de 
f(z) = = 4,X,(2), where A= y~. (38) 
l = 


i X2 (x) dx 
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After what has been said, the determination of the constants an 
and bn in equations (31) presents no difficulty at all; we have, in 
fact, on replacing f(x) by g(x) and p(x) in (88): 


l I 
i P (2) Xp (x) dz i pı (2) Xq (a) dx 
a a a 


a, = (39) 


i , T 
i XA (x) dx On j X} (a) da 

Substitution of all these in series (30) gives us the final solution 
of the problem. 

The forced vibrations of a rod are treated precisely as in the case 
of a string, except that the function f(x, ¢) is now expanded in functions 
X,(x) instead of in sines. 

It is clear from the above that the standing wave method is applicable 
with equal success to the vibrations both of strings and of rods. 
Whilst the method of characteristics is very useful in studying the 
vibrations of strings and the equation of telegraphy, it has not yet 
been applied with real success to equation (9). 


§ 20. Laplace’s equation 


192. Harmonic functions. We consider in the present article partial 

differential equations of the form 

U , PU , #U 

“Sat t ay T a = O (1) 
where U is a function of z, y, z. As already mentioned, (1) is known as 
Laplace’s equation. We have also seen above that the left-hand 
side is written symbolically as AU, this being described as Laplace’s 
operator on the function U. We saw in [87] that equation (1) must 
be satisfied by the potential of a gravitational force or of the inter- 
action between electric charges at all points of space outside the 
attracting bodies or the charges which produce the field. 

An equation of type (1) was also encountered in [114], where it 
was satisfied by the velocity potential of the irrotational flow of an 
incompressible fluid. We proved in [117] that (1) must likewise be 
satisfied by the temperature in a homogeneous body if the heat 
exchange is stationary, i.e. the temperature U depends only on the 
position of the point and not on time. Similarly, our investigation 
of [118] of a stationary electromagnetic field led to Laplace’s equation. 


192] HARMONIC FUNCTIONS 569 


If U is independent of one cvordinate, say z, equation (1) reduces to 
FU aU 
Oat F Oy? = 0. (2) 

In this case, U has the same value on any line parallel to the z 
axis, in other words, the values of U show the same variational picture 
on any plane parallel to the zy plane, so that only the latter plane 
need be considered. 

A function which is continuous, together with its derivatives up 
to the second order, in a volume (three-dimensional domain) (D) 
where it satisfies equation (1) is said to be harmonic in (D). The 
same term is applied in regard to equation (2) for a domain in the 
xy plane. We elucidate below some properties of harmonic functions. 

A function U, in addition to satisfying equation (1), has usually 
to be subjected to certain boundary conditions in problems of mathe- 
matical physics. Initial conditions are naturally absent in the present 
case. Boundary value problems for equation (1) amount fundamentally 
to the following: to find a function which is harmonic in a domain (D) 
and the values of which are assigned on a surface (S) of the domain. 
This is generally known as Dirichlet’s problem. When we speak of the 
values of U “on a surface (S) of the domain” we understand the 
limiting values attained by U on approaching points of (S) from 
inside (D). The problem may be formulated more precisely as: to 
seek a function U which is harmonic inside (D) and is continuous 
over (D) including its boundary (S), the values of U being specified 
on (8). The function specified on (S) must naturally be continuous. 
We shall assume for simplicity that the boundary of (D) is a single 
closed surface (S). It may be noted that (D) can be finite or infinite. 
In the latter case, it lies outside (S). With a finite domain, we have 
an interior Dirichlet problem, and with an infinite domain, an exterior 
Dirichlet problem. The latter problem requires the further condition 
that the function tends to zero at an infinite distance, or as we 
usually say, vanishes at infinity. The boundary condition for Dirichlet’s 
problem is written as 

U \s) =f (M), (3) 
where f(M) is a continuous function given on the surface (8) and M 
is a variable point of (S). The interior Dirichlet problem is similarly 
stated in regard to equation (2) for a plane domain, the boundary 
condition consisting of a specification of U on the contour of the. 
domain. The exterior Dirichlet problem on a plane requires the 
function to have a finite limit at an infinite distance. 
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We shall mention one other type of boundary condition, when the 
normal derivative is assigned on the surface (S): 


ðU 
On ko =f). (4) 
The task of finding a harmonic function satisfying this type of 
boundary condition is known as Neumann’s problem. It is met with 
in hydrodynamics when considering the motion of a rigid body in 
an ideal incompressible fluid. Boundary condition (4) here expresses 
the equality of the normal components of velocity of a point M of 
surface (S) of the body and of the fluid particle adjacent to M. 
Neumann’s problem can likewise be stated for equation (2). 
We shall derive some necessary formulae before passing on to 
elucidate the properties of harmonic functions. 


193. Green’s formula. Let (D) be a finite domain bounded by a 
surface (8), and let U, V be two functions which are continuous and 
have continuous derivatives up to the second order inside (D) up to 
its boundary (8). We consider the integral: 


I =J [J Gee + ne T oS} de = J | ares Ogee V dv. 
On using the obvious identity: 


U OV 


Oa? 


Əs ðr ðs 


oc) 


au av. ə (v a 


and the corresponding identities for 0/oy and 0/0z, we can write the 
integral as 


A EE (0H) +E A 


We transform the first term on the right by using Ostrogradskii’s 
formula: 


r={ [U Zcos (n, X) + U $Z cos (n, Y) + 
(S) 
+U ÎE cos (n, zjjas— f [ [UAV de 
(D) 


or [102]: 
r=f fu as — ff [vard, 
(S) (D) 
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where (n) is the outward normal in respect to (D) at points of the 
surface (8). 
We thus arrive at what is known as Green’s preliminary formula: 


[E+ E+ ea 
= | Jeeta area ove ds — hearer (5) 


The left-hand side of this equation remains the same on interchang- 
ing U and V, so that the same must be true for the right-hand side, 
i.e. we can write: 


ILES ds — p varies] [P SU dg fI PATa 


whence Green’s formula is obtained in its final form: 
f[ {wav —vav) do = feg -rs jas. (6) 
(D) (S) 


We sometimes use the inward instead of the outward normal, in which 
case we only have to change the signs of the derivatives with respect 
to the normal on the right-hand side; thus Green’s formula reads 
with the inward normal: 


Leave papies (Fag e a) as, (6,) 


where n; is the normal direction into (D). 

The domain (D) can be bounded by a number of surfaces (8). 
Green’s formula is applicable in this case except that the surface 
integral on the right has to be taken over all the surfaces bound- 
ing (D). It should be noted that the outward normal (n) from (D) 
is now directed into the surfaces that bound (D) from the inside [63]. 

As we have remarked, it is sufficient to require the continuity of 
functions U, V and of their derivatives up to the second order as 
far as the surface (S) when deducing Green’s formula (6). Certain 
demands must naturally be imposed on (8). We may fall back here 
on the conditions for which Ostrogradskii’s formula was deduced [63]. 
These conditions amounted to the following: the surface (S) can be 
split up into a finite number of pieces such that there is a continuously 


\ 
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varying tangent plane on each piece as far as its boundary. Such a 
surface is said to be piecewise smooth. The boundaries of the pieces 
of surface in question must be piecewise smooth lines. This condition 
imposed on the surface can likewise be 
expressed analytically. 

An important practical corollary of 
Green’s formula expresses the value of 
the function at any given point M inside 
(D) as the sum of a surface and a volume 
integral. Let the function U (M) be defined 
in the domain (D) and be continuous 
along with its derivatives up to the 
second order as far as (8). 

We apply Green’s formula to this function and to the function 
V = l/r, where r is the distance from a given point M, inside (D) 
to a variable point M. The function V = 1/r becomes infinite if 
the point M coincides with M,, and we cannot apply Green’s formula 
to the whole of (D). We isolate M, with a small sphere of centre 
M, and small radius ọ, and write (2,) for the surface of this sphere 
and (D,) for what is left of (D) after removing the sphere (Fig. 141). 
The functions U and V = I/r have the required continuity in (D,) 
and we can apply Green’s formula to this domain, which gives us: 


MESO -1 AU]dv = 


a(-) 1 a0 | {I a(-) 1 av | 
-Jle ath _ 1 lag |) ly te 1 Was, (7) 


(Eo) 

the integration being carried out over both the surfaces (S) and 
(,) bounding (D,). But, as we have seen, V = 1/r satisfies Laplace's 
equation, i.e. A(1/r) = 0 [119]. Furthermore, the normal n is in 
precisely the opposite direction to the radius r on the sphere (Z), 
so that the normal derivative in the integration over (2,) is taken 
with the opposite sign to the derivative with respect to r. On taking 
all this into account, we can write (7) in the form: 


U Mle eC) -1a las4 


[f2 vas- ffi Zas =o. (8) 


(Eo) (Zo) 


Fie. 141 
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We now let the radius pọ of the sphere tend to zero. In this case, 
the first term in the last equation tends to the volume integral over 
the whole of (D) [86]. The second term is independent of 9. We show 
that the third term tends to the limit 4x U(M,). Using the fact that 
r has the constant value g on (2,), we can write: 


ie l U (M)aS = ce Uae 


(Zo) 


The mean value theorem gives us: 


[J ae U (Uf) ds = U (M,)-4ng? = 4U (M,), 


(Zo) 


where M, is a point on the surface of (2,). This point tends to Me 
as ọ—> 0, whence it is clear that the above expression tends to 
4x U(M,). Similarly, application of the mean value theorem to the 
last term gives us: 


-ffa 0U 2U ag = E [eras = -45 Peda — 2U My e 


The first order derivative of U with respect to any direction remains 
bounded as M, tends to M,, since by hypothesis U has continuous 
derivatives up to the second order everywhere inside (D). The factor 
4no tends to zero as pọ— 0. Hence it is clear that the last term in 
equation (8) tends to zero. The limit of equation (8) finally gives the 
required corollary of Green’s formula: 


=) 
f [fac +} flo da -aa las + 4xU (M,) = 
(D) (S) 


or 


U (M) = aleg- Daata (9) 


We notice once more that this formula is valid for any function U 
which is continuous and has continuous derivatives to the second 
order in the domain (D) as far as (8). 

Similar formulae are applicable in the case of a plane. We shall 
state these without dwelling on their proof. Let (B) be a plane domain 
with contour (l), and let n be the outward normal with respect to (B) 


574 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS [194 


to the contour. In the case of a plane, Laplace’s operator takes the 
form in Cartesian coordinates: 


ZU aU 


AU = Oat + oy’ 


The formula for a plane corresponding to (6) runs: 


ei) a (Up -r Phas. (10) 


There is no complete analogy in regard to expression (9) since it is 
essential for (9) that the function 1/r satisfies Laplace’s equation. 
This is not true on a plane, and instead of 1/r we have to take a 
solution of Laplace’s equation of the form log r or log (1/r) = —log r, 
where 7 is the distance from any fixed point of the plane to a variable 
point M. Thus instead of (9), we have the formula on a plane: 


1 0 (1 0U 1l 
U (Me) =| [u Se" -ierat ae | AU-logr ds, (11) 
) 


where M ¿is any fixed point inside ( B) and r is the distance of a variable 
point M from Me. 

It may be noticed that the triple integral in expression (9) is 
improper, since the integrand becomes infinite at the point M,. The 
integral is obviously convergent, however, since the absolute value 
of the integrand is less that A/r? with p = 1. A similar remark applies 
in the case of expression (11). 


194, The fundamental properties of harmonic functions. We take a 
function U, harmonic in a bounded domain (D) with surface (8). 
On assuming that U and its derivatives to the second order are 
continuous as far as (8), we can apply Green’s formula (6) to U and 
the harmonic function V =1, and obtain, since AV = A(1) = 0 
and 3(1)/3 n = 0: 


{{as=o, (12) 
(S) 


which gives us the first property of harmonic functions: the integral 
of the normal derivative of a harmonic function over the surface of the 
domain is zero. 
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If we apply expression (9) to the harmonic function U, we obtain, 


since AU = 0: 
i ðU sila 
Uh) =a Pa vie or ds. (13) 


Hence we have a second ane of harmonic functions: the value 
of a harmonic function at any interior point of the domain is expressed 
by equation (18) in terms of the values of the function and its normal 
derivative on the surface of the domain. 

We remark that the integrals in (12) and (13) do not contain second 
order derivatives of U. In order to apply these formulae, it is sufficient 
to assume continuity of the harmonic function and its first order 
derivate as far as (S). This may be seen simply by applying a slight 
contraction to the surface (3) and writing (12) and (13) for the con- 
tracted domain (D’) in which there is continuity of the second order 
derivatives as far as the boundary surface; we then pass to the limit 
on expanding (D’) back to (D). The contraction can be accomplished, 
say, by adding the same small length 6 to the inward normal at each 
point of (S). The ends of the added lengths form the new (contracted) 
surface. The surface (S) must be such that, for all sufficiently small 6, 
the operation described leads to a surface which does not cut itself 
and which is piecewise smooth [193]. This matter is treated in 
more detail in Volume IV. 

We shall apply (13) to a particular type of domain — a sphere 
of radius R and centre M,; we assume, of course, that the function U 
is harmonic in the sphere and that both U and its first order derivatives 
are continuous as far as its surface (Xp). 

In this case, the outward normal n has the same direction as the 
radius of the sphere, so that we have: 


fo 


On or re? 
and equation (13) gives: 


U (M,) =a [IF oe vt 7 U)as. 


But r has the constant aia j% on the surface Xr of the sphere, 


so that: 
U (Mo) = qr | f as +- SS Uas, 
(£R) (Zz) 
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or, by (12). we can finally write 


sf U de 


U (Mo) = S (14) 


This formula expresses a third property of harmonic functions: 
the value of a harmonic function at the centre of a sphere is equal to the 
arithmetic mean of its values on the surface of the sphere, i.e. is equal 
to its integral over the surface divided by the surface area. 

This leads us almost at once to a fourth property of harmonic 
functions: 

A function which is continuous as far as the boundary of a domain 
and is harmonic in the interior attains its greatest and least value only 
on the boundary, unless the function is a constant. We give a detailed 
proof of this proposition. Let U (M) attain its greatest value at some 
interior point M, of the domain D where U(M) is harmonic. We draw 
a sphere 4, with radius g and centre M, so that it belongs to D, 
then we apply (14) and change the integrand U to its greatest value 
UM on £, We thus get 


U (M,) < US, 


the sign of equality only being obtained in the case when U is con- 
stant on 2, and equal to U(M,). Inasmuch as U(M,) is the greatest 
value of U(M) in D by hypothesis, we must in fact obtain the sign 
of equality, so that we can say that U(J/) is constant inside and on 
the surface of every sphere with centre M, belonging to D. We show 
that it follows that U(M) is constant throughout D. 

Let N be any interior point of D. We have to show that U(N) = 
= U(M,). We join M, and N by a line of finite length, say by a step- 
line, which lies inside D, and we let d be the shortest distance from 
this line J to the boundary S of D (d is a positive number). By what 
has been proved above, U(M) is equal to the constant U(M,) on a 
sphere with centre M, and radius d. Let M, be the last point of 
intersection, reckoning from M, of the line Z with the surface of this 
sphere. We have U(M,) = U(M,), and by what was proved above, 
U(M) is also equal to the constant U(M,) on the sphere with centre 
M, and radius d. Let M, be the last point of intersection of J with 
this sphere. As above, the function U(M) is equal to the constant 
U(M,) on the sphere with centre M, and radius d, and so on. By con- 
structing a finite number of such spheres, we can verify that U(N) = 
= U(M,), which is what we had to prove. It can also be shown that 
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U(M) can have neither maxima nor minima inside D. By using the 
demonstrated properties of harmonic functions, it is very easily 
shown that the interior Dirichlet problem, mentioned in [185], can 
have only one solution. In fact, if we suppose that two solutions U,(M) 
and U,(M) exist, harmonic inside D and having the same boundary 
values {(M) on the boundary S of D, the difference V (M) = U,(M) — 
— U,(M) will also satisfy Laplace’s equation inside D, so that this 
is also a harmonic function, with a boundary value of zero everywhere 
on S. Hence it follows directly from what has been proved above 
that V(M) is identically zero throughout D, since otherwise it would 
have to attain a positive greatest, value or negative least value inside 
D, which is impossible. Two solutions U,(M) and U,(M) of Dirichlet’s 
problem must thus coincide throughout D. The uniqueness of the 
solution of the exterior Dirichlet problem is similarly proved, on 
taking into account the vanishing by hypothesis of harmonic functions 
at infinity. 

Precisely analogous properties are obtained for harmonic functions 
on a plane. We have here, instead of (13): 


1 l 0U 
U (Me) = 35 | (U ee ogr gr) ds, (15) 


whilst the mean value theorem takes the form: 


U (Ma) = z |U ds, (16) 
2R 


where dp is the circle of radius R and centre M,. For the exterior 
Dirichlet problem, we require only the existence of a finite limit at 
infinity, and not vanishing, as in the three-dimensional case, whilst 
the proof of uniqueness is different from that given above. The 
required proof will be found in Volume IV, where we consider the 
Dirichlet and Neumann problems in more detail. 
We now notice that any constant is a harmonic function satisfying 

the boundary condition: 

ELA 2 

On |e , 
whence it follows that the addition of an arbitrary constant does not 
affect the solution of Neumann’s problem with given boundary 
values of 0U/dn, i.e. the solution is defined to within an arbitrary 
constant. It also follows from equation (12) that the function f( M) 


578 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS {195 


appearing in the boundary condition of the interior Dirichlet problem 
cannot be arbitrary, but must satisfy the condition: 


SffUhds =o. 
S 


It may further be [noted in conclusion that expression (13) still 
holds in the case when U(M) is a harmonic function in an infinite 
domain, consisting of the part of space outside the surface S. Here, 
we only have to make a hypothesis about the order of smallness of 
U(M) at infinity, ie. when the point M becomes infinitely remote 
It is sufficient (and necessary) to assume the validity at infinity 
of the inequalities 


R|U (M)|< 4; RAT <A, (*) 


where R is the distance of M from the origin or any other fixed point 
of space, A is a numerical constant, and l is an arbitrary direction 
in space. To prove relationship (13) for an infinite domain with the 
conditions indicated, we simply apply it for the finite domain bounded 
by the surface S and by a sphere of sufficiently large radius with centre 
at say M,. As the radius tends to infinity, the integral over the 
spherical surface tends to zero in view of the conditions stated above, 
and we obtain (13) for any point M, lying outside S. As we shall see 
in Volume IV, conditions (*) are certainly satisfied if U(M) tends 
to zero as M moves to infinity. 


195. The solution of Dirichlet’s problem for a circle. We saw in the 
previous section that the solution of Dirichlet’s problem is unique. 
However, we do not yet know if a solution in general exists. We shall 
confine ourselves to particular cases and shall not consider existence 
in the general case. Various methods of solution will be used, and we 
start with the plane case. 

Let it be required to find a function, harmonic inside a circle and 
taking previously assigned values on the circumference. Let R be 
the radius of the circle, and let the centre be taken as coordinate 
origin. The boundary values given on the circumference will now 
represent some known continuous function /(9) of the polar angle on 
the circumference. We take a variable point M with polar coordinates 
(r, 0) inside the circle. The required function must satisfy Laplace’s 
equation [119]: 

0 3U 1 U 
a = 9 
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or 
» U eu 
or? Fr S 062 =0. (17) 


We apply Fourier’s method here and seek the solution of (17) as. 
the product of a function of @ only and a function of r only: 


U = x (0): œ (r). (18} 
We substitute this expression in (17): 
2w” (r) x (8) + r œ (r) x (8) + x” (8) œ (7) = 0 


(0) ra (r) tro (r) 
z0 o) i (18;) 


or 


The left-hand side of this last equation contains only the inde- 
pendent variable 6, and the right-hand side contains only r, so that 
both sides must be equal to the same constant which we can write 
as (—k?). We thus get two equations: 


x” (0) + key (8) =0 and rw” (r) + rw’ (r) — kow (r) = 0. 
The first of these gives, for k ¥ 0: 
x (0) = A cos kð + B sin kð. 


The second is Euler’s equation [92]. We seek its solution as w(r) == 


rm; 


r2.m(m — 1)r™-2 + rmr™—-1 — krr = 0, 
whence we obtain on cancelling r™ : m? — k = 0, ie. m= +k, so 
that the general solution becomes: 
w (r) = Crk + Dr-*, 


provided the constant k differs from zero. Substitution in expression 
(18) gives us for U: 


= (A cos kð + B sin k0) (Cr* + Dr-*). (19): 


With k = 0, we have the equations: 
x” (0) = 0 and rw” (r) + w (r) = 0, 


from which we readily find that 
= (A + B9) (C + Dlogr). (19,) 


580 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS [195 


We now proceed to determine the constants A, B,C, D and k appear- 
ing in expressions (19) and (19,). We notice that the addition of 2% 
to the angle 0 is equivalent to a rotation about the origin, in the 
course of which the single-valued function U(r, 0) must return to its 
initial value. Thus the first factor of (19), depending on 6, must be 
periodic in 0 and of period 2x. It follows that the constant k can only 
take integral values, k = +1, +2, +3,...,-+n,... 

But if we substitute k = n or k = —n in (19), the result obtained 
must be essentially the same in view of the arbitrariness of the 
coefficient B, so that we can confine ourselves to positive integral 
values of k (the characteristic numbers of the problem), i.e. k =n 
(n = 1, 2,3, ...). 

The periodicity of solution (19,) requires the vanishing of the 
constant B. We thus arrive at the following solutions: 


U,, (7, 0) = (An cos n6 + B, sin n6) (Cnr + D,r—") (n= 1, 2,...) 
Uy (r, 0) = Ay (Co + Dy log T), 
where the constants have been provided with subscripts since they 
can be different for different values of n. Turning now to the second 
factor, that depends on r, we notice that the solution has to be finite 
and continuous at the centre of the circle, i.e. for r = 0. This implies 
that the constant D, and all the D, have to be put equal to zero. 


On writing A, for the arbitrary constants A,C,, and similarly Bn 
for B,C, and A,/2 for A Cg, we can express the solutions as 


U,,(r, 6) = (A, cos nO + B, sin n8)r" (n=1, 2,...): 
A 
U, (r, 0) = is . 
Since Laplace’s equation is homogeneous and linear, the sum of 
these solutions will also be a solution, i.e. we arrive at the solution as: 


U(r, 0) = fe + > (An cos 6 + B, sin n6)r". (20) 
n=l 


We now determine the arbitrary constants A, and B, in accordance 
with the given boundary condition: 


U (r, 0) |,-r = f (8), (21) 


where /(@) is a continuous function given in the interval (—2 < 0 < x) 


and f(—zx) = f(z). 
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This condition gives 
f0) = Aa + > (A, cos nô + B, sin nb) R". (22) 
n=! 


It is clear from this that An R” and B„R” are the Fourier coefficients 
of f(0) in an interval of length 2x, say in the interval (—x, x). On 
evaluating these from the familiar formulae 
+n 


=A f(t) sin nt dt (23) 


+x 
A,= ae JIO cosnidt; B, = : 
and substituting the values obtained in equation (2), we get the 
required solution of Dirichlet’s problem. 

On comparing Fourier series (22) with the solution as expressed 
by (20), we can state the result obtained as follows: the solution of 
Dirichlet’s problem for a circle is obtained by writing down the Fourier 
series for the boundary values f(0) and multiplying the (n + 1)-th term 
by (r/R)". 

The solution can be expressed as a definite integral instead of by 
the infinite series (20). We substitute expressions (23) for the coeffi- 
cients in (20): 


U (r, 8) =f roa +52] 40) cone 0). Bk 


or 
U (r, 9) = 4 frof + 25 (F) osn —6)| dt. 


Formula (14) of [I, 174] gives immediately: 


l— r? 


n Sih 
1+2 5r COS NY = 32 —Greosp F l 


n=l 


O<r< 1). (24) 


On replacing r by r/R and by (t — 0), we finally get the following 
expression for U(r, 0): 


R? — 72 


+n 
l 
U(r, 0) =ar | AO eR P (25) 


It may be pointed out that if we wrote (+k?) instead of (—k?) for 
both sides of equation (18,), we should have (4e? + Be~**) instead 
of (A cos kð + B sin k0) in expression (19), and the exponential 
expression is not periodic for any real k. 
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We have assumed in deducing formula (25) that a solution of the 
Dirichlet problem, i.e. a function U(r, 6), exists. Moreover, we have 
made use of the Fourier expansion (22) of f(@), which is not necessarily 
valid, and we have substituted r= R directly in this expansion. 
All this obliges us to check formula (25), i.e. we have to show that 
the integral on the right-hand side yields a harmonic function inside 
the circle r< R and that /(@) gives the boundary values of this 
function on the circumference. We generally refer to the integral in 
(25) as a Poisson integral. 


196. Poisson integrals. We shall take the radius R of the circle as 
unity in this section so as to simplify the writing; in this case, (25) 
takes the form 

+2 
1 l-r? 
Ur.) =ar | EOF (26) 


-7 


The integral yields a function of r and 90, since the second factor of 
its integrand, 


1-7 
1 —areos(@— 0) Jr? (27) 


contains the parameters r and @ in addition to the variable of integra- 
tion t. This second factor has a period of 27 with respect to 0, and 
the same is therefore true of function (26). It follows from the obvious 
inequality 1 — 2r cos (t — 0) + 7? > 1 — 2r + 7? = (1 — 7)? that (27) 
and its derivatives of any order are continuous functions of r and 6 
for 0 < r < 1. This means that the integral of (26) can be differentiated 
with respect to r and 6 under the integral sign [80], and this can only 
concern the factor (27). But it is easily shown by using the expression 
for Laplace’s operator in polar coordinates [119] that (27) satisfies 
Laplace’s equation. It follows at once from this that expression (26) 
defines a function U(r, 9) which is harmonic for r < 1. An important 
part of the proof still remains: to show that U(r, 0) is equal to /(@) 
on the circumference r = 1. 

We remark first of all that the harmonic function U(r, @) is identi- 
cally equal to unity if we set f(t) = 1 in (26), in other words: 


+7 


_ 1l 1—,r? 28 
1 =a J Ia cos G0) prt TE 2) 


— y 
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The proof is as follows. We have from (24): 


l-r? 


Iros fel +2 gr cosn(t—0) (0<r< l), 
< 


=1 
the series on the right being uniformly convergent in regard to t, 
since its terms do not exceed 27” in absolute value. On integrating 
term by term with respect to t, we obtain (28). 

Function f(é) is defined on the circumference r = 1 and has period 
2n, ie. f(—x) = f(x). We continue it outside the interval (—z, 2) 
in a periodic manner so as to get a continuous function f(t) of period 
2x in the interval — œ < t < +, 

We introduce a new variable of integration » = ¢ — 6 in place of ¢, 
so that t = pọ + 0 and dt = dg. On taking account of the periodicity 
of f(t) and cos (t — 0), we can retain the original interval of integration 
(— x, x) [142] and write: 


1— 7 


+7 
U (r, O= | AOH eae. (29) 


Let the point (r, 0) tend to (1, 9,) on the circumference. We have 
to show that, with this, 


lim U (r, 0) = f (6p). 
We carry out the change of variable in the integral of (28), multiply 


both sides of the equation by f(8,), and subtract the equation obtained 
from equation (29): 


1—r? 


fn 
U(r, 8) — f(80) = 52 | Uf @ +9) —f Od pace tp AP (30) 


We now want to show that the integral on the right tends to zero 
as r—> 1 and 0 —> 6,, i.e. that its absolute value is as small as desired 
for r sufficiently close to unity and 6 to 69. For any given positive e, 
an 7 can be found such that in the interval —7 < 9 < ^, 


IF + 9) — f(O) <= (31) 


for all 6 sufficiently close to 6). We divide the interval of integration 
in (30) into three parts: 


(— x, — n), (— 7,0), (4, 7). (32) 
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We consider the absolute value of the integral over the second 
part: 


+n 
I= 35 | +9 —/)] re. 
=n 


— 2r cosg + r? 


On noticing that the fraction in the integrand is positive, replacing 
the difference term by its absolute value, and using (31), we get: 


€ l-r? 
|< * 2x | reece. 
—n 


or, on widening the interval of integration: 


E l=r? 
lLI<z' 3 +Í Er T ET 


whence, in view of (28): 


L< $. (33) 


We now consider the integral over the first of intervals (32). 
We have cos p < cos 7 in this interval, so that 


1 — 2r cosy + r? > 1 — 2r cosy + r? = (1 — r)? + 27 (1 — cosy) 
or 


1 — 2r cos g + 7? > 4r sin? 4. 


The absolute value of the difference [f(g + 6) — /(@))] does not 
exceed some fixed positive number M, since f(t) is a continuous 
periodic function. We can thus write for the integral over the first 


interval: 


[I< (1-7) (@—0), 


Sar sin? 4L 7 


whilst a similar inequality is obtained for the integral over the third 
of intervals (32). The right-hand side of this inequality tends to 
zero as r—> 1, so that the sum of the integrals over the first and 
third intervals has an absolute value less than ¢/2 for all r sufficiently 
close to unity. If we take into account (33) and the arbitrary smallness 
of £, we can say that the right-hand side of equation (30) in fact tends 
to zero as r—> 1 and 0 —> 5. 
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The connection between the integral expression (26) and the Fourier 
series for {(@) should be noticed. The Fourier series is of the form (22), 
where for the moment we put R = 1: 


A Ba i 
> + = (A,, cos nð + B sin nð), (34) 


the coefficients being defined by (23) with R = 1. If, for instance, 
f(0) satisfies Dirichlet conditions [143], series (34) is convergent for 
all 0. We cannot assert this in the general case of a continuous function. 
We can always say, however, that A, and B, —> 0 as n —> œ, so that 
the series 


fo + Š (An cos nð + B, sin n6) r” (35) 
n=l 


converges for r < 1, and as is clear from [195], the sum of the series 
in fact gives function (26). It also turns out that the sum of series (35) 
tends to (0) as r —> 1, i.e. the sum tends to the function from which 
the Fourier series (34) originated, although this latter series may in 
fact be divergent. 

We apply this idea to any series, say 

S u,,. (36) 
n=l 

If this series is convergent to the sum s, Abel’s theorems regarding 

power series [I, 148] show that the series 


w (r) = > Unt” (37) 
n=0 


is convergent for 0 < r < 1, and in view of its uniform convergence 
in the interval 0 < r < 1 [I, 149], we have: 
lim w (r)=8. (38) 
r+1—O 
It may happen, however, that series (36) is divergent, whilst 
series (37) converges for 0 < r < 1 and w(r) has a limit as r > 1 — 0, 
i.e. (38) is valid. In this case, s is called the generalized sum of divergent 
series (36) in Abel’s sense, and we refer to an Abelian summation 
of (36). It follows at once from what has been said that the generalized 
sum exists for convergent series and is the same as the ordinary sum. 
The results obtained above in regard to Poisson’s integral can be 
stated thus: the Fourier series of a continuous periodic function f(0) 
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has an Abelian summation for any 0 and a generalized sum equal to (6). 
It should be noticed that, in investigating the Poisson integral, the 
point (r, 6) has tended to (1, 65) in any manner, and not necessarily 
along a radius. 

Let us suppose that r > 1 in integral (26). We may verify, exactly 
as above, that (26) yields a harmonic function outside the circle r = 1. 
We investigate its boundary values by rewriting it in the form: 


1- (F) 
U (r, 0) = Ber errs) dt. (26,) 


This new integral is the same as (26), provided we replace the r 
in the latter by l/r, whilst we have l/r < 1 since r > 1. Thus all 
our previous discussion is applicable, on changing r to 1/r, to the 
integral (26,), which therefore tends to f(6,) as the point (r, 6) tends 
o (1, 99) from outside the circle. Hence we can say that the function 


+n 
1 r? — l 
V (7,8) =a {fo 1 — 2r cos (t — 6) + r? dt 
gives the solution of Dirichlet’s problem with boundary values f(0) 
for the part of the plane lying outside the circle r = 1. As is evident 
from the last expression, the function V(r, 0) has a finite limit when 
the point (r, 6) becomes infinitely remote: 


lim V (r, 0) = al f (t) dt. 
r=% 

As we remarked above, the solution U (M) of the Dirichlet problem 
for an infinite part of the plane lying outside a closed contour 1 is 
unique if we assume that the required function tends to a finite limit 
when the point M is at an infinite distance (cf. Volume IV). 


197. Dirichlet’s problem for a sphere. Let R be the radius of a 
sphere (X) and let f( M’) be the given boundary value of a harmonic 
function on the surface of the sphere, M’ being a variable point of 
the surface. We assume that the function f(M’) is continuous on the 
surface of the sphere. 

Let r denote the distance of a variable point M of space from an 
arbitrary fixed point M, inside (£). In addition to M, we take the 
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point M, lying on the produced radius OM, of the sphere and such 
that (Fig. 142): 
OM,-OM, = R?. (39) 
The point M, outside the sphere is sometimes said to be symmetrical 
to M, with respect to (2). Let 7, denote the distance of the variable 
point M from M,. If M lies at M’ on the surface of (2), r and r, are 
connected by a simple relationship which we shall now deduce. The 
triangles OM, M’ and OM, M’ are similar, since they have a common 
angle with vertex at O and the sides 
forming this angle are proportional 
by (89). It follows from their 
similarity that 


|MoM’| _ |OM| p 7 _ |OMo| 
| MM’ | | OM’ | To R 
whence 

l1 e@ 1 

oe p 0) 


where ọ = | OM, | is the length of 
the radius vector from the centre 
of the sphere to M,. Since M, lies 
outside the sphere, l/r, does not 
become infinite inside the sphere and therefore represents a harmonic 
function inside it [119]. Equation (40) gives the boundary values of 
this function on the surface of the sphere. Let U(M) be the solution 
of the Dirichlet problem. Equation (13) gives: 


31 
vata = 2) [29 es v has | (41) 


On the other hand, we find on applying expression (6) to the harmonic 
functions U and V = ijr: 


1 
= {fre 1 wy ad las. (42) 


On multiplying both PA of (42) by the constant R/42o and sub- 
tracting from (41), we can eliminate a) én by using (40): 


var dole “eee 


Fig. 142 
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The given function /(M’) represents the values of U on (X), so 
that we can write the above as 


2) aft 
our =2-{{rar|2 Gd = E as. (43) 
(2) 


This is in fact the solution of the Dirichlet problem for a sphere 
since the integrand is known. We shall put the difference in square 
brackets in another form. We first remark that the surfaces 7 = const. 
are spheres with centre M, so that grad r is a unit vector in the 
direction M, M, and consequently, 


a = grad, 7 = cos (r, n) 


and 


1 
= ‘Bq = pe 008 (r, n). 


Similarly, 
ak 1 ) 
= 7 cos (71, n), 


where the r and r, in the cosines denote the directions M, M and 
M,M. This gives: 


Rn r) 1l R 
oo an pu m yr COS (r, 2) — or? cos (7,, n). (44) 
On introducing 9, = | OM, | = R?/e, we can write from triangles 


OM’M, and OM’'M;: 
e? = R? + r? — 2Rr cos (r, n); of = R? +7} — 2Rr cos (r, n). 


Having thus determined cos (r, n) and cos (r, n), substitution in 
equation (44) gives us, in view of (40) and the definition of 9,: 


1 1 
zl) el _ Be 
e On on — Rre ?’ 


so that we can now write (43) as: 


U (Mo) = aay | ff) ER as, (45) 
(=) 
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Alternatively, if we bring in the angle y formed by the radius vector 
OM, with the variable radius vector OM’, the angular spherical 
coordinates (6’, g’) of the point Jf’, and the spherical coordinates 
(0% Oo Po) Of M, with the origin at O, we can write: 


, 4) Rt — 9? > , , , 
U (0, 0o Po) = ix affres TETO sin 6’ dé’ dg’. (46) 


This integral form of U(M,) is analogous to the Poisson integral 
obtained in the case of a plane. To prove that the integral appearing 
in (45) yields a harmonic function, it is sufficient to show that the 
fraction (R? — 9*)/r3 is a harmonic function of M, for fixed M”. 
We introduce spherical coordinates with the origin at M’ and the z 
axis directed from M” to O and with the usual notation of 0 = 4 OM’ Mọ. 


We now have: 
Ri—e? = 2Rcos8 


r3 r? 


a 
: 


We find on substituting the right-hand side above in the spherical 
coordinate form of Laplace’s equation that we in fact have a harmonic 
function of the point M,. We now show that, for any position of M, 
inside the sphere: 


rR? — 92 = 
oan r3 dS =1. (*) 
z 
We take spherical coordinates with the origin at O and the z axis 
directed from O to M» with in this case 0 = L M, OM’ and r? = 
= R? — 2R ọ cos 0 + ọ?. The integral in equation (*) becomes 


2a n 
Ea Rt sin 8 d dy 2: 
4nR (R3 — 2Rọ cos 6 + 92)*/2 
a 
__ (R®?—9)R sin 6 dé = 
_ 2 | (R? — 2R cos 0 + g3)” 
0 
R2-—- o? 6-0 


(R? — 2ọR cos 6 + ?)—” 


2g =x 


whence, if we take into account the fact that 9 < R, we get expres- 


sion (*); 
ea a ee) 
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The further proof of the fact that integral (45) has boundary 
values /(J£) on the sphere proceeds as in the case of the Poisson 
integral. 

The solution of the exterior Dirichlet problem with boundary 
values {(M’) is given by: 


U (Mo) = ax | [10 5" as (45,) 
p 


or 


2n n 
R Pa 2— R? ; , r 7 
U (@; 99; Po) =a fre 9 enna ER in 8 dd’ dy’, (461) 


where 9 =| OMe l, r =| M, M’ | and y=ZM,OM’, whilst here 
e > R. It may be shown as above that the integral of (45,) yields a 
harmonic function outside the sphere. To verify that U(M,) has the 
boundary values {( M), we rewrite (46,) as: 


U (0, Oo» Po) = 
-gf per 7 wey in 6’ d6’ dg’, (46 
= = g’) (Ra 29°R’ cos y + 07)" sin p’, (46,) 


where 9’ = 9} and R’ = R~. With this, we have o’< R’, and 
when the point (0, 99, po) tends to M(R, 4, œ), lying on the sphere 
2, (0’, 95, Po) tends to (R’, 0, p). We have, from the result obtained 
for the interior of the sphere: 


2n n 


R’ , , R? — 9” , ; ' 
a] ff (9 +P ) (R? þau 20’ R’ cos y + 07)8/2 sin 0 ad dy —>f(M), 


and we can say, since 9’-> R’, that the right-hand side of (46,) in 
fact tends to f{(M), as we wished to prove. It should also be noticed 
that, by (46,), U(@, 99, Po) tends to zero when M, is at an infinite 
distance, i.e. when 9 —> œ. This follows from the fact that the numera- 
tor in the integrand of (46,) contains 9”, whilst the denominator is 
clearly of order 9°. 


198. Green's function. The solution of the general case of the interior Dirichlet 
problem for any surface (S) may be deduced from the above solution for the 
spherical case. Expression (13) does not give the solution directly since 0U/dn 
appears under the sign of the double integral, in addition to U itself, the 
values of which are known on the surface. To solve the problem, we have to 
eliminate the derivative. Let M, be a fixed point inside (S). Let G,(M; Mo) 
be a known function with the following two properties: (1) considered as a 
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function of the variable point M, it is harmonic inside (S); (2) its boundary 
value on (S) is 1/r, where r is the distance from the variable point of (S) to M,. 
Let U(M) be the required solution of the Dirichlet problem. We can write, on 
applying (6) to the harmonic functions U(M) and G,(M; M,): 


j= ff[v ae M,) ~@,(M; M) 2L 0 SUM) as , 
(5) 
or in view of the boundary value of G,(M; M,): 


= eee 20. (16s Ma) _ 1 BU UM) ) ag, 


On multiplying this equation by 2/4 and adding to (13), we get: 
U(M)=—qz\foaog[--aan mas. (47) 
. 4n On Lr Est he a= 
(S) 


.This gives the solution of the Dirichlet problem, if G,(M;M,) is known. 
The difference in square brackets: 


G(M; M.)=——G,(M; Mo), (48) 


is known as the Green function for the domain bounded by (S) with pole at My. 
Two basic properties of Green functions follow from the definition of G,(M; M,,): 

1. G(M; M ,) is harmonic inside (S) except at the point M, where it becomes. 
infinite; whereas the difference G(M; M,) — l/r remains finite and is harmonic 
everywhere inside (S). 

2. G(M;M,) has zero boundary values on (S). 

If we place a unit positive electric charge at M , and look on (S) as a conduct- 
ing surface joined to earth, Green’s function G(M; M,) gives the electrostatic 
potential of the field obtained inside (S). 

In the case of a sphere, G,(; M,) isequal to (R/e) ° l/r, from (40), and Green’s 
function becomes: 


G(M; M) = 1 — 1. (49) 


We have obtained (47) by using (13) and applying Green’s integral formula 
to U(M) and G,(M; M,). This procedure requires special proof, involving an 
investigation of the behaviour of the derivatives on approaching the surface 
(S). A rigorous proof of (47) with wide assumptions regarding the surface (S) 
and the function U(M) on (S) was first given by A. M. Lyapunov. 

We have a precisely analogous formula for the solution of the interior Dirichlet 
problem in the plane case: 


aa we 8G (Ms Mo) g 


U (M) =— z| U (M (48,) 


where the Green function G(M; M,) for the domain with contour / and pole at. 
M, must have the following two properties: 
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1. GM; M,) is harmonic inside (l) except at M,, where it becomes infinite; 
whereas the difference G(M; M,) — log l/r is harmonic at all interior points 
including M, 

2. G(M; M,) is zero on the contour (J). 

It may readily be seen that only one function can exist with these two pro- 
perties. If there were two, say GOM; M,) and G)(M; M,), their difference 
would be harmonic everywhere inside S or | and would be zero on. these bound- 
aries, i.e. it would be identically zero inside S or l. 


199. The case of a half-space. As an example of the application of (47), 
we consider the Dirichlet problem for a half-space. We require to find the func- 
tion U(x, y, z) which is harmonic in the half-space z > 0 and has known 
boundary values f(x, y) on the plane z = 0: 


U lemo = Í (2, y). (50) 


Let r be the distance of a variable point M from the point M (£o Yo Zo)» 
where z, > 0, and r, be the distance of M from M (£o Yo —Z), the symmetrical 
point to M, with respect to the plane z = 0. The fraction 1/r, is a harmonic 
function of M in the half-space z > 0, since M, lies outside this region. If M 
lies on the plane z = 0, obviously, 1/r, = l/r. Green’s function becomes here: 


1 1 
V (a — z0)? + (Y ~ Yo)? + C —20)* V (aw — 20)? + (Y — Yo)? + (2 +20)? 


From the point of view of the half-space, the outward normal to the plane 


z =Q is in the opposite direction to the z axis, i.e. 0/8n = —0/0z, and (47) gives: 
-+æ +o 
1 ə 1 
U eow = f S renl ~ 
OIE A Gel ¥(@— z)? F (y— v)? + @—*0} 


_ l | dz dy. 
Vie — 20)? + (y — Yo)? + (z + 2%)? Jz=0 


We have to put z = 0 after differentiating the square bracket. Some simple 
working eventually gives us: 


+o +œ 


z f(z, y) 
U (2n 2o 2o) = se f | tecr ra rev. (61) 


We shall not verify that the right-hand side represents a harmonic function 
and has limiting values f(x,y) when (a, Yo, Zo) tends to (x, y, 0). Infinitely 
remote points lie on the surface of the domain in the present case, and it may 
readily be seen that the solution has the following property: if f(x, y) is continu- 
ous at infinity, ie. has a definite finite limit a as the point (x,y) moves to 
infinity on the plane z = 0, U(2,, Yo Zo) has the same limit a a8 (£o, Yos Zo) 
moves to an infinite distance in any manner in the half-plane z > 0. 
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In other words, our solution has the required boundary values even for infini- 
tely remote points of the plane provided f(z, y) is continuous at these points. 

In the same way, solution of the Dirichlet problem for the half-plane y > 0 
leads to a Green function of the form: 


log dies log ais log l — log 1 
r T, V(@ — 2o)? + (Y — Yo)? Væ = zo)? + (y Fy) 


and with boundary values 


U ly=o = Í (2), (52) 


(48,) gives the solution as 


-+ œ 
U (zo yo) = 2 f EO de. (53) 


The detailed treatment of Neumann’s problem is left over to Volume IV. 


200. Potential of a distributed mass. We take the non-homogeneous 


Laplace equation 
ZU OU ZU 
Ja F Oy? Eg oe TP (x, Y, 2) (54) 


in a finite domain D with a surface S. The general solution consists 
of the sum of some particular solution and a function harmonic in D. 
Let a solution exist to which (9) is applicable. Since the derivative 
of 1/r with respect to any fixed direction satisfies Laplace’s equation, 
the integrand in the surface integral of (9) and the integral itself 
are harmonic in D. Thus the triple integral in (9) must satisfy equation 
(54). But by (54), the AU in the triple integral can be replaced by 
p(x, Y, z), so that we have as a particular solution of (54): 


U(x, y, 2) = -S {fee dv (55) 
D 


r= t-t- y+ (C— 2? 


We have obtained this result on the assumption that (54) has a 
solution to which (9) is applicable. A full solution of the problem 
requires a more detailed consideration of the volume potential (55) 
with definite assumptions regarding g(N). We write yu(N) = 
= —9(N)/4x and consider the following distributed mass potential: 


van ={ { [2a (56) 
.D 
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or 


V (x, y, 2) = f f [Ene dv. (56,) 
D 


Let u(N) be continuous in D as far as S. As already remarked, 
integral (56) is a particular solution if M lies outside D. In this case, 
V(M) has partial derivatives of any order. These derivatives can be 
obtained by differentiating under the integral sign, and V (M) satisfies 
Laplace’s equation AV =0.If M belongs to D, (56) exists as an impro- 
per integral and the integral obtained by differentiating the integrand 
e.g. with respect to x likewise exists. We have not shown, however, 
that this latter integral yields the partial derivative of V with respect 
to x. We shall prove two theorems regarding integral (56): 

THEOREM 1. If u(N) is continuous in D as far as S, V(M) and its 
first order partial derivatives are continuous throughout space and the 
derivatives can be obtained by differentiation under the integral sign. 

We shall carry out the proof for any position of M in regard to D. 
We replace 1/r by a new function which only differs from 1/r for r < e, 
where e is a given positive number, and which is continuous and has 
continuous derivatives with respect to the coordinates as far as r = 0. 
We do this by taking, for r < e, the polynomial: a + fr? =a + 
+ B[(E— x)? + (n — y} + (É — z}?], a and £ being chosen so that, 


for r= e: 
a + fet = + and 2pe= — : : 


E 
in this way, we obtain continuity of the derivatives at the meeting- 
point of functions 1/r and a + ĝr?, ie. at r= e. The equations 
written give: a = 3/2e, 8 = —1/2e3, and we arrive at the function 
g(7), defined by the equations: 


galr) =} for r>e 
(57) 


3 1 
Jer) = 35> — aa for r<e. 


On replacing the 1/r in integral (56) by this function, we get instead 
of V(M): 


V. (M) = | f fu) alr) de, (58) 
D 


which is continuous throughout space and has continuous partial 
derivatives which can be found by differentiating under the integral 
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sign, since the integrand of (58) is itself continuous and has con- 
tinuous derivatives for r > 0. We can write, for instance: 


BY (af) Ji fu (N) $g. (7) de. (59) 


We consider the difference: 
V (mM) — V.M) = f Í fe an[+—9 o] (60) 


Since 1/r and g,(r) are identical for r > e, their difference appearing 
on the right-hand side is zero for all points N lying outside the sphere 
o, with centre M and radius e. If, for instance, M lies outside D 
and c is less than the distance of M from D, the integral on the right- 
hand side of (60) is zero. 

In other cases, the sphere o, can belong partly or wholly in D. 
If we write m for the greatest absolute value of u(N) in D and 
remember that g,(r) is a positive function, we can write for the 
integrand in (60): 


[am-an] < mft +a], (61) 


whilst the integrand vanishes outside o,, as already remarked. We can 
clearly write, on integrating the positive function on the right-hand 
side of (61) over the whole of o,: 


e 2n n 


|V (M) — V. (M)| < me +9, o] 72 sin 0 dO dy dr. 


We obtain on replacing g,(r) by the second of expressions (57) and 
carrying out the quadratures: 


|V (M) — V,(M)| < Z me. 

It is clear from this that the continuous functions V,(M) tend 
uniformly as regards the position of the point M to V(M) as e— 0, 
so that V(M) is likewise a continuous function [I, 144]. We investigate 
the partial derivatives of V(M) by differentiating the integral of (56) 
with respect to x under the integral sign and writing W(M) for the 
new integral obtained: 


wimy=fffeme (=) a. (62) 
D 
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As above, we now consider the difference: 
8V,(M) _ ə fı ə 
W (m) — 220 =] ieia (=) — ae 9”) ] do 


On noticing that we es for any function A(r): 


dh Ae) a—é 
Ehi) = - 
and that |(~—é/r)| < 1, we can write as regards the last integrand: 
| ð (1 fo] l dg, (r) 
amje kao] < mfi], 


or, precisely as above: 


ran cof T+ 
000 


Since, by (57), 


dg, (r) 
dr 


Pr sin 6 dé dọ dr. 


dg, (r) 


ž 
7 | =F for r<e, 


we obtain after carrying out the quadratures: 
j ƏV. (M) 
Ox 


| W (M |< Same, 

whence it follows that as e —> 0, 0V,(M)/0 x > W(/) uniformly with 
respect to M. We proved above that V,(M) tends uniformly to V(M). 
Remembering the theorem of [I, 144], we see that W(M) is the 
partial derivative of V(M) with respect to x, ie. using (62): 


æ] J fenyw ff fama (ee 
D D 


The continuity of W(M) follows from the continuity of partial 
derivatives (59) and their uniform convergence to W(M), so that the 
theorem is fully proved. The derivatives with respect to y and z may 
þe investigated in precisely the same way. It may be noticed that 
our proof has made use only of the boundedness and integrability 
of u(N). 


201. Poisson’s equation. We must amplify our assumptions 
regarding u(N) before finding the second order derivatives of V(M). 
THEOREM 2. If the continuous function u(N) has continuous first 
order derivatives inside D, V(M) has continuous second order derivatives 
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inside D and satisfies the equation inside D: 
AV (M) = — 47u (M). (63) 


We take any fixed point M (£o Yo, Zo) inside D. Let o, be a sphere 
with centre M, and radius « lying inside D, and let D, be the part 
of D lying outside o,. We split potential (56) into two terms: 


vM) = f f futa 
D, 


f f u (N) + de = V, (M) + Vo (M), (64) 


Og 


and, in view of Theorem 1: 


“we ol I J Ma (Fae + 
ai fe (W) (yd = SE g D, (65) 
We have: 
e(s)=— (Fs (== 2 FF C- 2 ) 
and we can therefore write: 


none @)=— franz] +82 


r 


On substituting this expression for the integrand of the integral 
over a, in (65) and applying Ostrogradskii’s formula we get: 


wae =| | Je ze Ge} et 


Fh: au.) OL, arie (ANa aa a8, (66) 


where S, is the surface of sphere o, and n is the direction of the 
outward normal to S, at the point V. The first term on the right-hand 
side of (66) is a proper integra] for M lying inside c, and has derivatives 
of all orders inside o,. The same can be said of the third term, consisting 
of an integral over the surface of o,. The second term is the volume 
integral over ø, with continuous density 0 y(N)/aé, and by Theorem 1, 
has continuous first order derivatives throughout space. Hence we 
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can say that 0V(M)/d2 has continuous first order derivatives inside o,. 
In view of the arbitrariness of choice of M, in D, we can say that 
ƏV (M)/Əx has continuous first order derivatives everywhere inside D. 
On applying the same arguments to ƏV(M)/ðy and ƏV(M)/əz, 
it can be asserted that V (M) has continuous second order derivatives 
inside D. It now remains for us to prove (63) for any point M, inside D. 

We turn to equations (64) and (66). As we know, the potential 
V (M) of the distributed mass over domain D, is a harmonic function 
inside o,, since c, lies outside D,, ie. AV,(M) = 0 inside o,, so that 
AV(M) = AV ,(M) inside o, It follows that AV(M) is simply 
obtained by first differentiating the terms of (66) with integration 
over c, and S, with respect to x under the integral sign (using Theorem 
1), then obtaining analogous expressions for the second order derivat- 
ives with respect to y and z, and finally adding all three derivatives. 
It must be borne in mind here that only the factor 1/r in the integrands 
depends on (xz, y, z). Having thus found AV(M) inside o,, we take 
its value at the centre M, of sphere o,. If we write AV(M,) for this 
value, and r, for the distance from M, to the variable point of integra- 
tion, we get: 


av (ato) = ff f[Pae ee 


apee- oon 


> cos (n, z)| ds. (67) 


“cos (n, x) + 


aa 


This expression is valid for any choice of £, provided the sphere o, 
lies inside D, and the value of AV(M,) is clearly independent of the 
choice of e. We let e tend to zero, and show that the triple integral 
likewise tends to zero. It is sufficient to take the integral of one of 
the terms. Let m be the greatest absolute value of the continuous 
function 0n(N)/dé in a sufficiently small fixed sphere o,,. Recalling 
that | (E — 2)/r) | < 1, we have for e < é): 


[fg ao] om fF 
% 


On introducing spherical coordinates with origin at Jf, and writing 
dv = ri sin 0 d6 dp dr,, it may be seen that the right-hand side is 
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equal to m - 47e, whence it follows that the triple integral tends to 
zero as e— 0. 

We now consider the surface integral of (67). Since the outward 
normal n is directed along the radius of the sphere, we have: 
tie 


3 
To 


&— xo 


3 
To 


Y 


cos (n. x) + 7# cos (n, y) + Zo cos (n, 2) = 


3 
To 


= na [cos? (n, x) + cos? (n, y) + cos? (n, z)} 5 F 


so that the surface integral can be written as 


= fleas, 
Ss 


or, on using the mean value theorem, as 


[founds = 4m), 
Se 


where N, is a point on S,. As e— 0, N, tends to M, and u(N,) > 
— u(M,), and the surface integral of (67) gives in the limit 47 u(M ,), 
which brings us to (63). We generally refer to (63) as Poisson’s formula 
or Poisson’s equation. 

It follows at once from the theorem proved that, if p(x, y, z) is 
continuous in a domain D as far as its surface S and has continuous 
first order partial derivatives inside D, (55) gives the solution of 
equation (54). We remark that, if p(N) is defined throughout space 
and diminishes rapidly enough as NV moves away to infinity, we can 
take D as the whole of space. 

Similar theorems may be proved for the integral over a plane 
domain: 


V (M) = f fu (N) log do 
B 


or 


V (æ, y) = | f a nlog—do; (r= VE = 2 F =y). 
B 


If u(N)is continuous in B as far as its contour, V (M) is itself continuous 
and has continuous first order partial derivatives throughout the 
plane, these derivatives being obtainable by differentiation under the 
integral sign. If, moreover, u{N) has continuous first order partial 
derivatives inside B, V(M) has continuous second order partial 
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derivatives inside B and satisfies Poisson’s equation at every interior 
point of B: 
AV (M) = — 22u (N) 


In addition to integral (55), we consider 


vM)=— =| [feaaar; N) av, (55) 
p 


where G(M; N) is Green’s function in domain D with pole N. The integration 
in (55) is carried out with respect to the point N. We can write, on taking into 
account expression (48): 


mae | [JEP a+ ae | na: N) dv, 


where G,(M; N) is a harmonic function of M everywhere inside D and has 
the boundary value 1/g on S, ọ being the distance from a variable point on S 
to the point N. Since M appears as a parameter in the integrand of the second 
integral whilst G (M; N) is harmonic everywhere inside D, it follows that the 
second integral is likewise a harmonic function of M inside D. Laplace’s operator 
yields p(M) from the first term on the right-hand side by what has been 
proved, so that the U,(J) defined by (55,) satisfies equation (54). Further- 
more, in view of the fact that G(M; N) has zero boundary values on S, (55,) 
shows us that U,(M) satisfies the boundary condition on S: 


U, (M)|s =0. 


Equation (55,) defines the solution of equation (54) satisfying the boundary 
condition written. The boundary values of solution (55), obtained as values of 
the integral on the right when the point (x, y, z) lies on S, depend on ¢(z, y, z). 
It should be remarked that the above investigation of function (55,) is not 
strictly rigorous; we require a further investigation of the dependence of 
G(M; N) on the point N, a proof that we can differentiate under the integral 
sign and can pass to the limit under the integral when M tends to a point on 
the surface S (cf. Vol. IV). 


202. Kirchhoff’s formula. Equation (13) gives the value at every interior 
point of a function harmonic inside a surface S as an integral over S. An 
analogous expression can be got for the function V(x, y, z, t) = V(M; t), satisfy- 
ing the wave equation 

ov 
3t? 


=a AV. (68) 


Let V(M; t) be continuous together with its derivatives to the second order 
in a domain D bounded by a surface S, with allt > 0. Let M, be a fixed point 
in D and let r = M,M denote the distance from M, to a variable point M. 
We apply general formula (9) to the function 


U (x,y, z,t)=V (=. Ys Z, t— z) , (69) 
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or more briefly, 
U(M,t)=V (m, cs =). (70) 


If w(t) is a function of t, we write [w] for the function obtained by replacing 
the ¢ in w(t) by t — rja, i.e. [w] = w (t — rja). 

We usually call [w] the retarded value of the function w(t). The meaning of 
the term becomes clear if we take a as the velocity of propagation of some 
process. 

We can write (69) or (70) in this notation as: U = [V]. In differentiating 
(69) with respect to the coordinates it must be recalled that [V] depends on 
the coordinates both directly and indirectly via the r, appearing in the fourth 
argument. We thus get 


0U OV 13y] or 
on lon lta leone (71) 
Similarly, on using the expression for Laplace’s operator in polar coordinates 


with centre M, [119]: 


eU 2 0U 1 0. ,90U 1 &U 
ne or? + r or i r? sin 0 xl oa) résin?6 dg? 


-+ 


and bearing in mind that 
oU E ] >. FU TeV) PU [Sa] 
30 38 [ | Oy? 


> 002 — |3|’ Og? 
3U 25 [=] l É . &U 2 eV 2 r&y l | 
or or a |} Ərz =|$r|-slael< oe? |. 


we get: 
27 ev], 1V] 2 [av 
sv = 871-2 | Sar | tar Ler |- a Lr] 


But we have, by (68), [AV] =, l/a? [@?V/d2] and therefore: 


2 41 fev eV 1fovVv 
ww == llela 


It may readily be shown that 


AU 2 fl E 1f ov ] [+ (72 
r -2 al-F ler] -F | ) 
is the divergence of a vector: 
AU . §2 7 Ov 
= rae = div {= =| grad (log n} ° (73) 


We have in fact by the formula of [112]: 
div (JA) = f div A + grad f'A. 


In the present case, f = 2 [9V /ðt]ja, and A = grad (log r) is a vector of length 
l/r directed along the radius vector from M,. The scalar product grad f. A is 
the product of | A | and the projection of grad f on the direction of A, this latter 
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term being the derivative of f with respect to the direction of A. Thus we have 
in the present case: 


ov ov 2 @f7oV 
aiv {> [a at p [era (log n}= ala] 4% er + or Or cs at $ 
If we use (72) and differentiate [9V /0t] in accordance with the rule for func- 


tions of a function, (73) is in fact shown to be valid. If we now apply Ostrograd- 
skii’s formula and take into account the fact that grad, (logr) = (1/r xðr/ðn), 


en fff a FE] Sas 


On substituting this expression and expression (71) in the right-hand side 
of (9) and noting that U(M,, t) = V(M œ» t), since r = 0 at the point M,, we 
obtain Kirchhoff’s formula: 

al 


vonoi EER maes a 
(S) 


This expresses V(M ,, t) in terms of the retarded values of V, 9 V/3t and Ə V/ðn 
on the surface (8). As in the case of (9) for harmonic functions, the presence 
here of 0V/8n prevents us from using the formula for solving wave equation 
problems directly. The Kirchhoff formula is closely related to Huygens principle. 

Suppose that (S) is a sphere of radius r with centre M,. Here, 0/8n = 0/dr, 
and (74) becomes: 


1 OV r [ov 
v (Mo = srl [Lee] taba es, 
(S) 
or, on setting dS = r? sin 0 d8 dp = r? dw: 


V (Ms; o= aa) [i etaa S Par] (75) 


If we take the radius r = at, we have t—r/a = 0, i.e. the retarded value 
reduces to the value of the function at ¢ = 0, and (75) gives Poisson’s formula 
(76) of [171], i.e. the solution of the problem of an oscillation propagated in 
unbounded space with given initial conditions: 


V (Mot) =z an IES r)a +5 atl. [me Joao}, (76) 


the zero subscript wee that 0V/dt and V have to be taken at t = 0, 
whilst the integration is over the sphere of radius at with centre M,. There is 
a close connection between the form of Kirchhoff’s formula and the concept 
of retarded potential. We saw above that, for any choice of w(t), having continuous 
derivatives to the second order, the function 


so(r—Z)-2t (77) 
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is a solution of equation (68). Here, r is the distance from any fixed point of 
space to a variable point [175]. 

Precisely as above, we can construct a Kirchhoff formula for any solution 
of the non-homogeneous wave equation 


eV 
Ot? 


=a? AV + f (x, y, 2, t) (78) 


in a domain D, though the solution obtained contains a triple as well as surface 
integral: 


at 
V (Ma 2) =m bee) eleda od 8+ 
rpe 


On applying this formula to a sphere of radius at with centre M , for the solu- 
tion satisfying zero initial conditions at ¢ = 0, we arrive at (91) of [174]. 


§ 21. The equation of thermal conduction 


203. Fundamental equations, We have seen that the equation of 
thermal conduction in a homogeneous sphere has the form: 


eo (PU | PU , PU 
a Tl (a ta t ar)’ 


(1) 


k is the coefficient of internal conduction, y is the specific heat of 
the body and ọ is the density. In addition to equation (1), we have 
to bear in mind the initial conditions, giving the initial distribution 
of temperature at t = 0: 


u lio = f (x, y, 2). (3) 
If the body has a boundary surface (S), we also have boundary con- 
ditions on (8), which may vary according to physical factors. For 
instance, (S) may be held at a definite temperature, which may be 
variable with time. In this case, the boundary conditions imply 
specifying the function U on (8), this U being possibly dependent on 
time. If the temperature of the surface is not fixed and there is 
radiation to the surrounding medium of given temperature U,, the 
flow of heat across (S) is proportional by Newton’s law (admittedly 


where 
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far from accurate) to the temperature difference between the medium 
and the surface (8) of the body. This gives a boundary condition of 
the form: 

Sul LAU —U,) =0 (on 8), (4) 
where the coefficient of proportionality h is the coefficient of outward 
thermal conduction. 

For the heat distribution in a body of linear dimensions, i.e. in a 
homogeneous rod, which we assume to be located along the x axis, 
we have instead of equation (1): 


ee i (5) 


This form of equation does not take into account, of course, the 
heat transfer between the surface of the rod and the surrounding 
space. 

Equation (5) may also be obtained from equation (1) by taking U to 
be independent of y and z. The initial condition becomes for a rod: 


U limo = f (2). (6) 


If the rod is finite, we have boundary conditions for both ends. 
An end can be subject to a given temperature, as above. With 
radiation, boundary condition (4) takes the form: 


aU Eh(U—U)=0 (at the end), (7) 


the (—) sign being used for the left-hand end, with minimum abscissa z, 
and the (+) sign for the right-hand end, whilst h is a positive constant. 


204, Infinite rod. We start with an infinite rod, for which we only 
have to satisfy initial condition (6) in addition to equation (5). 
We follow Fourier’s method and first seek a particular solution of (5) 
in the form: 

T (t) X (2), 
which gives us 
T’(t) X (x) = a® T (t) X” (x). 
or 
Tt) X"(@) _ gs 


aT  X(x) 


where A? is a constant. We thus obtain: 


T(t) + aT (t)=0; X"(a) + a2 X(x) =0, (8) 
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whence, neglecting the constant factor in the expression for T(t): 
T (t) =e—¥, X (x) = A cos âx + B sin dz; 


the constants A and B can depend on A. 

Since we have no boundary condition here, the parameter À is 
completely arbitrary, and all values of 4 are of the same importance 
when we form the function u(x, t) as the sum 


e777 A (A) cos Ax + B (A) sin Az] 
(4) 


It naturally follows that we replace the summation over separate 
values of A by the integral with respect to 2 from (—°°) to (+°), 
i.e. we put 


© 


u x, t) = f e~¥a*t| A (2) cos Ax + B (A) sin Ax)] da. (9) 


We can easily verify that the function written is in fact a solution 
of equation (5) by using the formula for differentiation under the 
sign of the definite integral. We now turn to initial condition (6), 


which gives us 


ul-o=f(2) = | [4 (A)cos dx + B (A) sin Az] da. (10) 


On comparing the integral on the right with Fourier’s formula for 
the function f(x): 


f(a) = se | da f 7 (8) 008a(g — 2) dé = 


=> | [cos dx i f (E) cos AE aE + sin Ax Í f(é) sin ag dé] da, 


we see that condition (10) can be satisfied by putting 


A(a)= z] f f(E) cosag dé, B (4) = 4f ) sin AE dé. 


=o 
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Substitution of the expressions obtained for A(A) and B(A) in (9) 


gives us: 


u (a, t) =| f(E) dé f e—*2% [cos AE cos Ax + sin Aé sin Ax] dA = 
=a | AAE f eM 0084 (€ — 2) da = 


== | fede f e=% oos (E — 2) da, (11) 
-œ 0 


where we have used the fact that the integrand is an even function of A. 
Though the solution of our problem is given by equation (11), 
this may be simplified on recalling [81] that 


Ee omy 
f e—*4* cos BAdA = Yee tat, 


6 
whence 
= (§—x)® 
1 1 — “Tat 
zleu cos À (£ — x) då = mye at 
0 
so that (11) may be written: 
e 1 -6 
um d= AO gye dé. (12) 


We naturally assume that ¢ is positive in the above and later work- 
ing. The form of the solution has an important physical significance. 
We first of all remark that 


l S (=x)? 
sat ( 1 3) 


2a ¥ mt 


considered as a function of (xv, t), is likewise a solution of equation (5), 
as is evident from the method by which it is obtained, and as may 
be verified by direct differentiation. Now what is the physical meaning 
of this solution? 

We distinguish a small element of rod (x) — ô, 2) + ô) about the 
point 2, and let f(x) be zero outside, and a constant U, inside the 
interval (x, — ô, 2+ ô). This is equivalent physically to com- 
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municating a quantity of heat Q = 26ceU, to the element at the 
initial instant, thus causing a temperature rise of U, in the element. 
The temperature distribution in the rod at subsequent instants is 
given by equation (12), which becomes in our present case: 


Xotd Bs Xetd (=x)? 
(¢—x)® È 
Í Uo 1 e 4a?t dé = ee a e tart dé. 
2a Yat 2cga Yat 26 a 
es 


Xð 


If we now let ô tend to 0, so that the amount of heat Q is distributed 
over a constantly diminishing section of rod and is communicated 
in the limit to the point x, we arrive at the case of an instantaneous 
source of heat at the point x = x, of intensity Q. The presence of such 
a source in the rod leads to a temperature distribution given by: 


Xetd _ (ġ—x)3 


Q 


lim ———— — tat dé. 
+0 2cea ynat 26 
Xa—ô 
Since by the mean value theorem: 
Xt  (g—x)? _ Go—x? 
zy f e t qdé=e ‘ | where zo— ô< Eo < to +ô, 
X,—ô 


and ġọ—> 2, as ô— 0, the above expression is equal to 


Q 1 _. (Xox)? 
— =e 4a3t 
ce 2a Yrd 


It follows that function (13) gives the temperature distribution result- 
ing from an instantaneous heat source of intensity Q = co located at 
the point x = E of the rod at the initial instant t = 0 (replacing 2, by &). 
We can now see the meaning of solution (12). To give the section ¢ 
of the rod a temperature f(¢) at the initial instant, we have to distribute 
in the small element dé about this point the amount of heat: 


dQ = ce f (£) dé, 


or what amounts to the same thing, we have to locate at ¢ an instan- 
taneous heat source of intensity dQ; the temperature distribution 
resulting here is, by (13): 

1 _ -0 


f(s) db Re gat 
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The total effect of the initial temperature /(¢) at all points of the 
rod is the sum of these separate effects, which gives us solution (12) 


obtained above: 


uq, t= | 78) so Ga dé. 


Let the temperature f(x) at the initial instant t= 0 be zero 
everywhere outside an interval (a,, a), where it is positive. Solution 
(12) now becomes: 


u(x, nfo aye Car az, (14) 


If we take an arbitrarily ieee x and arbitrarily small ¢, i.e. consider 
a point as remote, and an instant as near the initial instant, as we 
wish, (14) gives us a positive value for u(x, t), since the integrand is 
positive. Solution (12) thus implies the fact that the heat is propagated 
instantaneously, and not with finite velocity. This is where the thermal 
conduction equation differs essentially from the wave equation which 
we obtained when considering the vibrations of rods. 

In the case of heat propagation in a non-bounded three-dimensional 
medium, we have differential equation (1) and initial condition (3), 
and instead of (12), we get the solution: 


u(x, Y, z, t) = 
ier aut (g—x)8+(n—y)#+ (22) 


= | J fre N, rd 4att dé dy dé. (15) 


— 00 = 00 = 00 


We shall verify that the function given by (12) satisfies equation (5) 
and initial condition (6). The first statement follows immediately 
from the fact that the integrand of (12) satisfies equation (5) and from 
the differentiability of the integral of (12) with respect to t and x 
under the integral sign, if, for instance, f(x) is continuous and absolutely 
integrable in the interval (— °°, ++ °°). As regards the initial condition, 
we bring in a new variable a instead of € in accordance with 


Eé—@2 
2a yt ` 


a = 


Equation (12) now becomes: 


+% 
u(x, t) = = i f(x + a2a Ytje” da. (16) 
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We recall from [78] that 
i 
l= 7 Í e~@ da. (17) 


On multiplying each side by f(x) and subtracting from (16), we 
have 


+o 
u (x, t) — (2) = 7 f U (+ aza Vi) — F (a)] e= da. 
whence 


|e (e 9 =A <7 | IF + a2a Ye) — f e) | 0 da, (18) 


We shall assume that f(x) is bounded as well as continuous and 
absolutely integrable, i.e. | f(z) | < c, so that we have for any zv, t, 
and a: | f(z + a 2a Yt) — f(x) | < 2c. Let £ be a given positive number. 
We can fix a large positive NV, such that 


-N % 
w ery en eee 
nie da< 3 a da < 7: 


With this, it follows from (18) that: 


+N 
|u (x, t) — f (æ) | siia | [fw + a2a Vi) — f(z) |e-* da. 
-N 


Since f(x) is continuous, we can say that, for all ¢ sufficiently near 
zero and for |a| < N: 


1 
|f (x + a2a Vt) — f (2)| < ze 
so that the previous inequality can be written: 
juz N= fe < eti. feda, 
or all the more: 


lu, Afa] f oe do, 
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i.e. we have by (17): | u(x, t) — f(x) | < e for all ¢ sufficiently near 
zero. It follows from this, in view of the arbitrariness of e, that 


lim u (x, t) = f(x), 
t++0 


which amounts to initial condition (6). It may be remarked that ¢ 
tends to zero through positive values. If m and M are the bounds 
of f(z), ie. m < f(x) < M, it follows from (16) that 
+o ars 

m —at mM —a? 

Va Je da < u (x, t) < E le da, 
Thus we have, by (17), m < u(x, t) < M, i.e. the temperature u(x, t) 
lies between the same bounds as does the initial temperature for all 
positive ¢. Similar methods may be used to verify expression (15). 


205. Semi-infinite rods. Let the rod be bounded at one end, say æ = 0, 
so that x > 0. We shall assume that there is radiation from the end into the 
surrounding medium at temperature 0°. 

We now have, in addition to initial condition (6), the boundary condition 


A (19) 


whereas solution (12) is at once seen to be unsuitable since, in view of the 
lnitial condition, f(x) is only defined in (0, œ). It follows that f(x) must be con- 
tinued into the interval (— œ, 0) before we can use (12). 

To this end, we rewrite (12) in the form: 


1 -F -p 
a | [ee +f(—he« Jas, (20) 


+20 0 -oo 
which is readily justified on splitting f into f and f , then replacing ¢ in 
-00 — %0 0 


the former by (— &). In order to substitute in (19), we evaluate 


(x- §)3 (x+6)2 


0 
Hence we deduce that, with x = 0: 
= 1 T -r d 
Whee = 75 Je IE +/(— 8} dg, 
Ou eee reg pee 2 
a ee | e O8] iE 
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Integration by parts gives us:t 


oo 


Zoo -Ba T R sa i 
[ros een [ral wt) =o T ta pf 


ae r ED 
+ [ ree” at= o) + [ ro” Tag, 
0 0 


and similarly, 


= ee p E 
fine WEE ato froe as. 
0 0 


We assume that f(x) is continued continuously into (— œ, 0). Clearly, then 


f(+0) = f(—0)=/(), 
and 


°° 


a 
== fe Iera]; 


condition (19) becomes 


oo £2 
1 r T oe B : : 
zal MEHED RE d=, 


which is certainly satisfied if we put 
EEDE E =h [fc t) + 


or, using for the present the notation 
PE =F (8); P(E) = f (—4), 
if we determine the unknown function ®($) from the differential equation 


D (E) +hD(E) =f (6)—hf() (€>0), 
We find by integrating this equation: 


é 
D (E) =e" fe + J e [r (6) —ht 8] asl. 


We find the constant C by setting £ = 0: 
C = ® (0) = f (0), 


t We assume that e~ 7/427 f(¢) — 0 as E> œ. 
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and since 


J = 
J treat = 4) — i ol f (E) dé = e" f (8) — F (0) — h j oM 7 (8) d 
é 


we have 


£ 
F (— E) = D (£) = f (Ẹ) — 2he™t? i ef #(8) d 


Substitution of this expression for f(— ¢) in (20) gives us the final solution of 
our problem. We remark that it follows from the last expression that, as +0, 
f(—0) = f(+0), i.e. f(x) is continued continuously into (— œ, 0), as we assumed 
above. 

If, for example, the initial temperature is constant: 


į (x) = Uo z>0, 
we have 


x 
f(—2) = uo —2he~* f Uy ef da = Uy 207" _ 1), 


0 
and (20) gives 
: [je oe 
u (x,t) ee fe 4a2t at — fe A ge 
2a Yt F ò 
EAP ng 
+2 fe att ath. (21) 

0 


It may readily be shown by the reader that this solution can be expressed 
in terms of the function 


O (x) = =a fozas 
0 


as follows: 


= -2 ah't+hx I= ad 
u (x, t) wo {527} tee | EAT 


A simpler result is obtained in the case when there is no radiation at the 
end « = 0, this being held at a temperature of 0°. We now have the boundary 
condition 


+ ah ri)|. (22) 


u |p = 0, (23) 


which can be obtained from (19) by dividing by k then passing to the limit with 
h— œ. The solution can be found from (22) by letting h — œ, but a simpler 
method is to continue f(x) directly into (— œ, 0) in such a way as to satisfy 
a 
aat 
uU hag = 7e Oo E ory $ dé =0, 
ie sara V+ a] 
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which simply means putting 


HC $) = — Fé), 


i.e. continuing f(x) in an odd manner. 
We now obtain instead of (20): 


(x-)? (x+)? 


meS z frole” a E Jas, (24) 


(0) 


and if 


u |i=0 = $ (£) = uo. 
the solution becomes 


x 


2aVt 
wg@naa f etat=we (i. (25) 
~ yt 


We now consider a rod bounded at the end x = 0 and subjected to a given tem- 
perature u = p(t) at this end. 
We first suppose that the initial temperature is 0°, i.e. 


u Jimo = 0, (26) 
and we start with the particular case g(t) = 1, i.e. 
U juo =l. (27) 


We can easily find the solution of equation (5) satisfying conditions (26) 
and (27). We do this by setting 


u=v+1; 
function v is likewise a solution of (5), but has to satisfy the conditions 


Pleo =93 Cmo =— l, 


so that v and therefore u is obtained directly from (25) by putting u, = —1 
there: 
v(m t) = e| z ) and ulm t) =1 o| z (28) 
i 2a Vi 2a yt 


We now find the temperature distribution if the end v = 0 is held at 0° 
up to the instant + then raised to 1°. Let the distribution be denoted by u,(z, t). 
We obviously have u, = 0 to the instant ¢ = 1; thereafter u, is the same as 
the solution found above if we agree to reckon ¢ from r instead of 0, i.e. if we 
put ¢ — z for ¢ in (28); this gives us 


0 for t<t 


U, (x,t) = 
r (x, t) 1-9 (5) i>r. 
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But it is now clear that, if the end v = 0 is held at a temperature of 1° 
only during the interval (r, r + dr) whilst being 0° throughout the rest of the 
time, the corresponding distribution becomes 


du, 
ðr 


whereas if it were subjected to a temperature p(t) instead of 1° during the interval 
(zt, = + dr), the solution would be 


Uy (T, t) —Upsas (T, t) = — dr; 


-p (1) eae, 


hence it is evident that, if the end x = 0 is subject to the temperature 9(r) 
for all t > 0, we obtain the full effect as r varies from 0 to t by summing all 
the elementary effects, which gives us the required solution in the form 


t 


u (2,1) =— [9 (7) Sear, 

0 

or, since we have for t > Tt: 
x 

2a Yt-r x? 

iot eee uses a 
ar a  \%ayt—t ar Vn 3 2a Ya (t — q)” 
the solution is finally 
t x2 
u (x, t) = 2 (20o a dr. (29) 


2a Vn P (t — r)? 


We evidently get the solution which satisfies, in addition to the boundary 
condition 


U [xmo = P (t) 
an initial condition of the general type 
U limo = Í (2), 


[nstead of the particular condition (26)] simply by adding to solution (29) the 
solution that we previously obtained, given by (24). 


206. Rods bounded at both ends. We examine one of the most 
typical cases, when the temperature at the end z = 0 is 0°: 
[xno == 93 (30) 


whilst heat is radiated from the end x = linto the surrounding medium 
with zero temperature: 


= —hu| (31) 


xml came f 
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and the initial temperature is 
ulmo =f (2) 0<r<D. (32) 


This problem is solved very simply by Fourier’s method. 
Since boundary conditions are present here, we subject the solution 
found above: 


eat X (x) = e77% [A cos Ax + B sin Ax] (33) 


to conditions (30) and (31), which give us 
X(0)=0, ie A=0; X'(D=—hX()), 


~ y 


N 
~ 
NS 
N 
N 
Ni 
™~. 
SSI 
~ 

“Ws -2 -V 


v Ve v. 
ONY v 
SND 
Tee 
| ! 
| | a 


‘N 


Fie. 143 


whence we obtain, neglecting the constant factor B: 


X (x) = sin Ax (34) 

and 
Acos Al = — hsindl. (35) 
On substituting Al = v, we obtain the transcendental equation 
tan v = av, wherea = -r (36) 


This equation has an infinite set of real roots (Fig. 143). We shall 
only take into account the positive roots: 


Oy, Vg, V3 sees Uns oes (37) 


corresponding to which we have an infinite set of values of A: 


Ay day day +r Am s+) Where 4, = “8, (38) 
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and to which, in turn, there corresponds an infinite set of particular 
solutions of equation (5): 


B e~% sind, x (x= 1, 2, 3,...), 


satisfying the boundary conditions. 
In order to satisfy the initial conditions, we seek u as 
u(x, t) = > B eña sin 4,2, (39) 


n=l 


and obtain with ¢ = 0: 
w| 0 = Í (x) = > B,sind,c= > B X, (x), (40) 
n=1 n=1 


where we have written X,(z) = sin 4, x. We shall prove that the 
functions X,(z) are orthogonal. 

We write down the differential equations corresponding to two of 
these functions: 


Xin (®) + Ain Xm (e) =O; Xp (2) +A Xn (x) = 0. 
On multiplying the first equation by X,,(x) and the second by X,,(z), 
subtracting the equations obtained and integrating over the interval 
(0, 2), we have: 


I 
f [X7 (2) Xn (2) — X% (2) Xm (2)] de + 
0 


+ (#2, — 2) pA (a) X, (z) dz = 0. 
0 
On integrating the first integral by parts, we get 
Kin (1) Xn (D) — Xp (D Xm (D + Xn (0) Xm (0) — Xm (0) Xp (0) + 
+ (a — an) [Xp (x) X,(z)dx=0. (41) 
But X,(z) and X,,(x) satisfy boundary conditions (30) and (31), i.e. 


Xm (0) = X, (0) = 0; Xml) = — AX, (l); 
X (D = — AX, (I). 
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In view of these equations, the term outside the integral in (41) 
vanishes, and if we bear in mind that 4 — 42 # 0 for different m 
and n, we get 


1 
X,(t)X,(x)dx=0 for mn. 

f Xn (2) Xn ( 

0 


Having established orthogonality, we proceed as usual to show 
that the coefficients B, in expansion (40) are given by: 


l l 
B, = § f (2) Xn (2) de: f X(x) da. 
ò ò 


This solves the problem of expanding f(x) in functions X,(z) and 
at the same time gives the solution of our main problem in the form 
of series (39). It will be shown in Vol. IV that the X,(z) obtained, 
as above, as a result of applying Fourier’s method to typical problems 
of mathematical physics form a closed system, and that, with certain 
assumptions regarding f(x), this latter can be expanded in the base 
interval as a uniformly convergent series in X,(x). It may be noted 
that, if we took boundary conditionsu = 0 for x= 0 and x = l, instead 
of (30) and (31), we should get X,(z)=sin naa/l, and arrive at the 
usual Fourier sine expansion. 

When investigating the propagation of heat in a ring, in addition 
to the boundary conditions, we have to stipulate periodicity of the 
temperature [cf. 195]. If we take the radius of the ring as unity, so that 
its total length is 22, and let z denote the length along the ring meas- 
ured from a given point, we arrive at a solution of the form 


u(x, th = i + > (a, cos ng + 6, sin nx) quant 

n=1 
where 

a 

> + > (a, 008 nx + 6, sin nz) 
n=l 

is the Fourier series for the initial temperature distribution f(x) 
on the ring. 


Sufficient conditions for the series obtained for u(x, t) to be the ac- 
tual solution of our problem will be found in Vol. IV. 


207. Supplementary remarks. We take the general equation for thermal 
conduction: 


—cv, (42) 
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obtained on the assumption of radiation from the total surface of the rod to 
the surrounding medium, the temperature of which is taken as zero. 

We can easily verify that equation (42) reduces to equation (5) for u by 
the simple substitution 


In the case of an infinite rod with zero initial temperature, i.e. with the condi- 
tion u = 0 for t = 0, the non-homogeneous equation 


ðu Ou 
has a solution of the form 
tto -02 
u (x, t) = | | F (£, t) Bas hie cs e aan 7) dé dr. (44) 
62 2a Vn (t — t) 


It can be found either by the same method as was used in [174] for the non- 
homogeneous wave equation, or by superposition of the basic elementary solu- 
tion (13), in which we replace ¢ by (t — t) then multiply by F(é, t) and integrate, 
from — œ to +œ with respect to ¢, and from 0 to ¢ with respect to r. The 
physical significance of these operations is obvious. The solution of (43) is found 
by means of superposition of sources, distributed throughout the rod with 
intensity F(¢, t), and starting to be effective at the instant 7. Superposition 
of the sources is also carried out with respect to time. 

Fourier’s method is applied to two and three-dimensional cases exactly as 
for the wave equation, except that the time-dependent factor is now an 
exponential function. 

For instance, the equation 

D ohe 
a ðr? ' Oy? 
leads in the case of a rectangular lamina to a solution of the form 
u = e72 U (x, y), (45) 


where we have written w? in the exponent so as to be able to use the expressions 
of [177]. Let the boundary condition be u = 0 on C and the initial condition 
u = p(x, y) for t = 0. The solution takes the form of the series: 


2° a 
—wg,rt . ONL , TH 
us Y agpo "* sin —— sin zy ; 
o,t=1 


where the 3 , are given by (113) of [177], and the a,, by the first of formulae 
(114). 

In the case of a circular lamina [cf. 178}, substitution (45) leads to the fol- 
lowing solution: 


oo 2 had 2 
= t - t. 
u= > anne mn cos nb, (kr) + D Bm,n® m.n sin nad p (kR r)o 
=0 n=l 
=1 m=1 
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where the am,n and Êm; p are given by the same formulae as the aD, and 
(i), of [178], and the m» n by (128). 


208. The case of a sphere. We give a parallel treatment of the wave equation 
and thermal conduction equation for a sphere: 


2. 
Tu =atau; (46) 
& =0@ sv, (47) 


with the assumption that the initial data depend only on the distance r of a 
point from the centre of the sphere: 


a 
ul = 9, (r); r = G2 (r); (48) 
t fmo 


V lemo = Y (r). (49) 


We take the boundary conditions: 


M o r=R; (50) 
2 L w=o r=R, (51) 


where R is the radius of the sphere and h > 0. In view of the central symmetry, 
the solution will be independent of the polar angle and will be a function of 
r and £ only. On setting: 


u = (A cos wt + B sin wt) U (r) ; (52) 
v= Ae—o*t V (r), (53) 
we obtain the same equation AW + k?W = 0 for U(r) and V(r), where k? = 


= w?ja?. On writing Laplace’s operator in spherical coordinates and using the 
fact that W depends only on r, we get: 


l d aw i aw 2 dw d 
war (a) TBM a0, ie Gaty a tW =o. 


We bring in a new unknown instead of W, given by 
R(r)=rW (r). 


On substituting in the equation for W, we get the equation for R : R”(r) + 
-+ k? R(r) = 0, whence R(r) = C, cos kr + C, sin kr, and consequently: 


cos kr +0, sin kr : 


W (r)=C, P F 
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In view of the solution remaining finite at the centre of the sphere, i.e. with 
r= 0, we must write C, = 0, and substitution in (52) gives us the solution 
as 


u = (A cos wt + B sin œt) si ; (54) 


v= Ae—ot Sinker | (55) 
Boundary conditions (50) and (51) give us the constant k, and consequently, 
w = ak. 
Application of the second of these to (sin kr)/r results in the following equa- 
tion for k: 
kRootkR=1—hR. (56) 
With k = 0, we arrive at the equation obtained from boundary condition 
(50): 
tan kR =kR. (57) 
On putting kR = v, we see that equations (56) and (57) are analogous to 
equation (36). Let k,, k,, ... be the positive roots of (56). On using (55), we 
get for v(r, t): 


kia — ak2 i 
v (r,t) = > ane akat sin kar (58) 
n=1 r 
Initial condition (49) gives: 
ry (r) = X a, sink,r. (59) 
n=1 


Precisely as in [206], the functions sin k, r are orthogonal in the interval 
(0, R), and the coefficients of expansion (59) are therefore given by: 


R R 
a,=JSryp(r)sink,rdr: f sin? kyrdr. 
0 0 
Turning to the equation for u, we first write kp (n = 1, 2, ...) for the positive 


roots of equation (57). We have to include here the root k = 0, corresponding 
to a zero frequency w. With this, we have to write A + Bt instead of (A cos wt + 
+ B sin wt), the equation for R(r) becomes R’(r) = 0, and W(r) = Rir)/r 
is constant, so that the corresponding solution of equation (46) becomes a, + 
+ b, t. This clearly satisfies boundary condition (50) for any values of constants 
a, and b,. We finally get for u: 


u (r, t) =a + bot + > (ap cos ak,t + b, sin kpt) sin kat 
n=! 


On differentiating with respect to t and setting t = 0, we get the expansions 
for the functions appearing in initial conditiosu (48): 


TQ, (r) = aor + > ansin kar; rp, (r) = bor + ez kpbn Sin kyr. 


n=l n=1 
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It may easily be seen by using (57) that the sin k, r are orthogonal to r in 
the interval (0, R) as well as to each other, i.e. 


R R 
frsink,rdr=0 and f sinkgrsink,rdr=0, for m#n, 
0 ie) 


and the coefficients in the last expansions are given by the usual rule: 


R R 3 R 
ag = fre. (r)dr : [ = dr = Re [ro (r) dr, 
6 0 


R R 
an = | ry, (r) sin kpr dr : | sint kyr dr. 
0 ò 

Similar expressions are found for coefficients b,. We remark that the solution 
v = const. is obtained for equation (47) with w = 0, but this solution does not 
satisfy boundary condition (51), since h > 0 by hypothesis. 

We can interpret (46) as the equation for the velocity potential u of the 
vibrations of a gas, where boundary condition (50) expresses the fact that the 
velocity of a gaseous particle situated on the surface of the sphere has a zero 
normal component. 

Boundary condition (51) for the thermal conduction equation (47) expresses 
the fact that the surface of the sphere radiates to the surrounding space, the 
temperature of which is held at zero. 


209. The uniqueness theorem. We now pass to the question of the 
uniqueness of the solution of the thermal conduction equation with 
given initial and boundary conditions [cf. 179]. We take the one- 
dimensional problem, i.e. the equation: 

2 

noet (60) 
for a finite rod, C< x <l. We draw a domain G on the zt plane, 
bounded by the lines x = 0, v = l, and situated above the segment 
0 < x < lof the v axis (Fig. 144). We also draw a linet = tọ tọ > 0, 
parallel to the x axis, and cutting a finite rectangle OAQP from the 
domain G, the rectangle being denoted simply by H. We prove the 
following theorem: 

THEOREM. Let the function u(x, t) satisfy equation (60) inside G and 
be continuous as far as the contour of G. The greatest and least values 
of u(x, t) in H are now attained on the part lof the contour of H, formed 
by sides OP, PA and AQ. 

We confine the proof to the greatest value and assume the theorem 
false. Let the greatest value of u(z, t) be attained in H or within the 


622 THE PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL PHYSICS {209 


side PQ at the point (x, y’), so that the greatest value on 1 is less than 
this first greatest value M. We construct a new function v(x, t) as 


follows: 
v(x, t) = u (x, t) — k (t — t). (61) 


where k is a positive number which we now fix. We have in the rect- 
angle H: 
i u(x,t) <v (x, t) < u (x, t) + kta, 


and k can be fixed near enough zero for the greatest value of v(x, t) 
on 1 to be, as in the case of u(x, t) also, less than the value of v(x, t) 


Fie. 144 


at the point («’,¢’). With this choice of k, v(x, t) will take its greatest 
value in H inside H or within the side PQ, and not on l. We consider 
these cases separately and prove that each leads to a contradiction. 

Let v(x, t) take its greatest value at a point C(2,, t,) situated inside H. 
The fact that we have a maximum of v(x, t) at C implies that, at this 
point [I, 58]: 


whence it follows that 


or, by (61): 
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But u satisfies equation (60) inside G, and the inequality written 
leads to the absurd result: —k > 0. We now let v(x, t) attain its 
greatest value in H at a point M(a,, to) situated within the side PQ. 
On considering the variation of v(x, t) along N, N, parallel to the t 
axis, we arrive at the inequality dv/at > 0 at N, inasmuch as the 
value of v(x, t) at N is not less than its values throughout N, N. 
By now considering the variation of v(x, t) along PQ, we arrive at the 
inequality 67v/dz? < 0 at the point N, since v(x, tọ) has a maximum 
at N (x = v). Thus dv/dt — a? d°v/da?>0 at N, and we arrive at the 
same contradiction as above, so that the theorem is proved. 

It immediately follows from this theorem that, if u(x, t) vanishes 
all along the contour l, u(x, t) also vanishes throughout the rectangle 
H, which leads us very simply to the uniqueness theorem. 

Suppose that, in addition to equation (60), we have the following 
initial and boundary conditions (specifying the temperature at the 
ends): 


U limo =f (8) (OSH <I; Upp = 0 (t); Uleni =O, (t). (62) 


These conditions amount to specifying u(x, t) on the piece J of the 
contour of G. We assume that these boundary values represent 
functions continuous throughout the contour of G, including points 
O and A, i.e. w(0) = f(0) and w,(0) = f(J). With conditions (62), let 
there exist inside G two solutions of equation (60), u (x, t) and u(x, t), 
continuous as far as the contour of G. Their difference, u(x, t) 
= U,(x,t) — u(x, t), is now a solution of (60), equal to zero on l. 
It follows at once from the theorem proved above that u is zero every- 
where inside G, i.e. u(x, t) coincides with u(x, t). We remark that the 
uniqueness theorem is preserved if, instead of demanding continuity 
of u(x,t) at points O and A, we only require boundedness of the 
function in the neighbourhood of these points. In this case, the bound- 
ary values likewise do not need to be continuous at O and A. 

The solution for an infinite rod is given by (12). We suppose that 
the given function f(x) is continuous and vanishes outside the segment 
(—b, +6), so that 


l +b R (§—x)* 
4a't 
z J fée dé. 


U(x, t) = 


2a 


It is easily shown by using this formula that u(x, t)—> 0 uniformly 
with respect to t as r — + or x —> — ©, i.e. for any given positive € 
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there exists a positive M such that | u(x, t) | < e for læ] >N and 
for any positive t. We prove that only one solution exists with this 
property for a given initial condition (6). It is sufficient to show, as 
above, that u(x, t) takes its greatest and least values on the z axis. 
We do this by reductio ad absurdum. Let u(x, t) take its greatest value 
M at a point C(z, t), where ¢, > 0, i.e. f(x) < M in the interval 


t c 


Fie. 145 


—œ < x <+. In view of the fact that f(x) = 0 outside the interva 
(—b, b), we can say that M > 0. We draw the two lines z = d and 
x = —d, choosing d sufficiently large for the inequality | u(x, t) | <M 
to be valid on these lines, then we draw the rectangle H formed by 
these lines, the x axis, and the straight line through C parallel to the 
x axis (Fig. 145). The function u(x, t) has a greater value at C than on 
the piece Z of the contour of H, consisting of the three sides: x = d, 
xz = —d and t = 0. Thus u(x,t) attains its greatest value in regard 
to H either inside H or within the side passing through C, and this 
leads to a contradiction, as above. Hence we have proved the unique- 
ness of the solution having the property stated above and with the 
assumptions made regarding /(2). 
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